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The  enclosed  report  presents  a thorough  study  of  the  dif- 
ferential equation 


mx  + cx  + kx  = Df(x,x)  , 

where  f is  a discontinuous  function  which  assumes  only 
the  values  ^1.  The  study  shows  how  f should  be  defined 
in  order  that  any  error  in  (x,x),  that  is,  any  deviation 
from  (0,0),  will  vanish  in  a minimum  of  time.  Curves  are 
drawn  in  the  xx  phase  plane  showing  where  f should 
"switch"  its  value  from  >1  to  -1  or  vice  versa.  The  work 
should  be  of  interest  to  people  concerned  with  the  design 
of  servomechanisms  and  similar  devices. 

The  report  was  written  as  a dissertation  for  a Doctor  of 
Philosophy,  Department  of  Mathematics,  Princeton  University. 
ITie  work  involved,  however,  was  a part  of  the  research  pro- 
gram under  the  Office  of  Naval  Research,  Limit  Control 
project.  Contract  Nonr-26302,  NR  3U1-009  at  the  txpori- 
raental  Towing  Tank,  Stevens  Institute  of  Technology.  It  is 
therefore  being  published  by  the  latter  organization  as  a 
technical  report  on  the  project. 
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SUMMARY 


In  many  problems  arising  in  connection  with  designing  servomechanisms 
and  similar  systems  one  has  to  solve 


m -»■  c ^ + kx  = -D 

d?  * 


with  the  discontinuous  forcing  term  -D  , and  to  find  a transition  or 
switching  curve  in  the  phase  plane  so  that  solutions  reach  the  zero  state 
in  minimum  time. 


This  problem  arises  in  designing  the  systems  so  as  to  obtain  optimum 
performance,  and  is  therefore  of  considerable  practical  as  well  as  mathe- 
matical interest;  but  very  little  work  has  previously  been  done  on  it,  and 
that  mainly  from  a physical  rather  than  mathematical  point  of  view. 

In  this  thesis  there  is  given  a complete  solution  of  this  problem. 

The  treatment  is  much  more  lengthy  than  one  would  expect,  because  it  has 
been  necessary  to  break  the  problem  up  into  cases;  none  of  the  general 
methods  available  in  the  literature  apply.  The  final  result  is  given  in 
Theorem  10,  page  63 . 

In  addition  we  have  recalled  the  situation  that  occurs  when  the 
switching  curve  is  linear.  We  have  also,  at  the  end,  touched  upon  certain 
problems  that  arise  when  the  equation  is  nonlinear,  notably  of  the  ■randerPol 
type.  This  case,  however,  is  so  much  more  complicated  than  the  linear  case 
that  no  attempt  has  been  made  to  give  a complete  solution. 
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I.  INTRODUCTION 


A simple  exairple  will  serve  to  illustrate  the  kind  of  physical  situa- 
tion in  which  the  central  problem  of  this  paper  arises.  Consider,  there- 
fore, a servomechanism  which  consists  of  a motor,  a source  of  power,  and  a 
feedback  circuit.  We  shall  suppose  that  the  purpose  of  the  servomechanism 
is  to  hold  the  output  of  the  motor  constant,  say  at  the  value  y^  , where 
to  fix  the  ideas  we  shall  suppose  that  the  output  of  the  motor  is  measured 
in  terras  of  the  angle  y of  the  rotor  with  respect  to  some  fixed  reference 


y - yo 


position.  The  feedback  circuit  is  sensitive  to  errors  in  y , i.e.,  to  the 
quantity  y - , and  its  object  is,  upon  sensing  such  an  error,  to  apply 

to  the  motor  an  input  the  effect  of  which  will  be  to  tend  to  nullify  this 
error.  The  input  to  the  motor  is  denoted  by  f , and  will  be  supposed  to 
depend  on  x = y - y^  and  the  first  derivative  with  respect  to  time  of  this 
quantity. 

The  differential  equation  for  such  a system  can  be  taken  to  be 


or , equivalent ly , 


I 


K-f 


(1) 


where  I is  a constant  representing  the  moment  of  inertia  of  the  rotor, 
R is  a constant  representing  various  sources  of  energy  dissipation  in 
the  system,  and  K is  the  "torque  constant."  For  a given  motor  (so 


-a- 


See,  e.g.,  James,  Nichols,  and  Phillips;  Theory  of  Servomechanisms 
(19U7),  p.  111. 


that  I , R , K are  fixed)  the  crucial  element  in  this  equation  is  nat- 
urally the  function  f , which  is  determined  by  the  design  of  the  feedback 
circuit  and  by  the  strength  of  the  power  source.  , 

The  problem  to  be  considered  here  arises  when  one  puts  a further  as- 
suiiiption  on  f , namelj'-  the  assumption  that  this  function  can  take  on  only 
the  two  values  -g  , where  3 is  a certain  positive  constant.  This  situa- 
tion occurs  when  the  feedback  circuit  acts  simply  as  a switch  (relay)  which 
applies  to  the  motor  the  full  strength  of  the  power  source  either  directly 
(+3)  or  after  inverting  its  polarity  (-3).  This  scheme  has  the  prima  facie 
advantages  that  (1)  the  feedback  circuit,  since  it  no  longer  needs  to  yield 
a continuously  varying  output,  can  be  vastly  simplified;  and  (2)  it  would 
seem  likely  that,  by  always  using  the  full  strength  of  the  power  source, 
one  should  be  able  to  smash  any  transient  errors  to  zero  more  rapidly  than 
by  any  other  means.  For  the  first  reason,  such  servos  are  in  fact  exten- 
sively used;  but  it  seems  to  be  the  opinion  of  many  experts  that  the  second 
reason  is  not  sound,  for  such  an  intense,  "bang-bang"  servo  is  too  crude  to 
give  a delicate  response,  and  is  prone  to  display  several  kinds  of  highly 
undesirable  behavior:  high-frequency,  low-amplitude  oscillations  ("chat- 
tering"), medium-frequency  oscillations  of  constant  or  increasing  amplitude 
("hunting"),  and  others.  But  naturally  all  this  depends  on  the  character 
of  f , and  there  is  no  reason  to  deny  the  possibility  that  there  may  exist 
some  function  f or  class  of  such  functions  which  would  avoid  these  un- 
pleasant phenomena  and,  in  fact,  give  excellent  performance.  It  will  be 
proved  below  that  there  do  exist  such  functions  and  that  there  even  exists 
a unique  such  function  f which  gives  the  best  possible  performance,  where 
"best  possible"  has  a certain  natural  and  definite  meaning.  This  will  in- 
deed be  shown  for  the  more  general  equation 


d^x  , dx 
m — 7T  + r + cx 
,.2  at 
dt 


(2) 


/ dx  \ + 

where  m , r , c , and  D are  constants  and  f " "1*  This  equa- 

tion can  be  put  into  one  of  Lhe  three  simple  forms 


R-1+69 
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(2') 


(2") 


(2IM) 


by  choosing  the  proper  units  for  time  (t)  and  error  (x),  according  as  c 
is  positive,  zero,  or  negative.  (The  constants  m and  D are  assumed  to 
be  different  from  zero;  it  is  nc  restriction  of  the  generality  to  suppose 
that  they  are  both  positive.)  For  instance,  to  get  (2')  from  (2)  when 
c > 0 , put 


X 


b 


1 

2 


rD 


The  problem  which  will  be  treated  in  this  paper  is; 
In 


(3) 


where  g(x,y)  is  ^ given  function  of  class  C and  <p  (x,y)  is  a_  function 

which  assumes  only  the  values  -1  , how  should  (;^{x,y)  ^ chosen  so  that 

the  solution  of  (3)  for  any  set  of  initial  conditions  (x^,x^)  ^x  = — ^ 
reaches  the  state  x = 0,  x = 0 , and  in  fact  reaches  it  in  less  time  than 
for  any  other  choice  of  cf}  (x,y)  ? 


(in  terms  of  our  mechanical  example,  this  means;  how  should  the  feed- 
back circuit  be  designed  so  that  if  the  output  suddenly  undergoes  a dis- 
turbance which  results  in  a certain  error  and  rate  of  change  of  error,  then 
these  two  quantities  are  brought  back  to  zero  simultaneously  and  as  rapidly 
as  possible?) 


A restricted  form  of  the  problem  has  been  thoroughly  studied  by  a 
group  of  people  represented  by  I.  Flugge-Lotz  and  K.  Klotter  (see  the 


R-ii69 
- k - 


bibliography)  | they  treat  the  equation  ( 2 ' ) with  0 < b < 1 and  assume  that 
has  the  special  form 

4>  (x,y)  =■  sgn(Kx  + My) 

where  sgn  is  the  function  whose  value  is  +1  for  positive  argument  and 
-1  for  negative  argument,  and  K , M are  constants.  Their  results  will 
be  outlined  in  II,  where  it  will  appear  that  in  this  case  the  problem  as 
stated  above  is  insoluble  and  that  the  problem  of  choosing  the  "best"  , 
i.e.,  the  "best"  values  for  K and  M,  becomes  one  of  avoiding  as  many 
undesirable  phenomena  as  possible.  The  "best"  (p  so  determined  indeed  de- 
pends for  its  efficacy  on  the  assumption  that  the  equation  (2')  gives  an 
essentially  incomplete  description  of  the  physical  situation,  that  in  fact 
"time  lags"  occur.  D.  McDonald  (McDonala  (1))  has  discussed  the  problem 
for  the  equation  (2")  with  (p  again  general  and  has  stated  the  correct  re- 
sult for  this  case,  on  the  basis  of  a heuristic  argiiment.  Except  for  these, 
no  results  have  been  given  for  the  problem  stated  above. 

The  first  principal  new  result  of  this  paper  will  be  Theorem  1,  which 
greatly  restricts  the  class  of  functions  (p  (x,y)  which  one  needs  to  con- 
sider in  seeking  a solution;  then,  on  the  basis  of  this  theorem,  the  problem 
is  solved  for  all  linear  g(x,y)  . After  this  something  is  said  about  the 
nonlinear  case,  and  the  paper  concludes  with  brief  discussions  of  some  dis- 
tinct but  closely  related  problems. 

It  should  be  remarked  that  the  problem  admits  of  various  generaliza- 
tions, for  none  of  which  significant  results  are  known.  For  example,  one 
can  consider  a higher  order  differential  equation,  say  of  order  n (n  > 2) 

and  require  that  on  any  solution  the  quantities  x , (k  < n) 

dt^ 

should  at  some  instant  vanish  simultaneously  and  in  the  shortest  possible 
time;  or  one  might  consider  systems  of  equations,  each  involving  a dif- 
ferent function  of  the  type  (x,y)  , for  exanple: 


R-ii69 
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(This  would  represent  the  problem  for  a mechanical  system  with  two  degrees 
of  freedom  with  coupling.)  Problems  of  a different  character  arise  when 
one  remains  with  the  equation  (3)  but  supposes  that  the  externally  caused 
errors  are  not  of  the  simple,  square-wave  type  we  have  considered,  but  of 
j some  more  intricate  but  statistically  describable  type.  In  view,  however, 

of  the  difficulties  involved  in  dealing  with  the  simpler  problem  here  dis- 
j cussed,  the  possibility  of  obtaining  significant  general  results  for  the 

more  complicated  problems  seems,  at  present,  rather  remote. 

I 

''  Terminology 

j|  The  equation  (3)  is  equivalent  with  the  system 

dx 

ii  ^ ^ 

0^) 

ij  ^ = <^(x,y)  - g(x,y) 

lj  where  g(x,y)eC  and  (x,y)  - -1. 

Suppose  for  the  moment  that  (x,y)  = +1  . Then  the  system  (li)  has 
i;  a unique  solution  through  every  point  planej  in  other  words, 

the  family  of  curves  defined  by  (!;)  witn  (p  (x,y)  s +1  covers  the  entire 
plane  exactly  once.  This  family  of  curves  will  be  called  the  P-system 
■ (P  for  positive),  its  curves  P-curves,  and  the  arcs  of  its  curves  P~arcs. 

In  the  same  way,  when  <p  (x,y)  S -1,  one  gets  another  family  of  curves 

' ■ covering  the  planei  this  is  the  N-system  (N  for  negative)  and  N-curves  and 

N-arcs  are  defined  correspondingly, 
i . 

I 

A positive  direction  of  motion  (an  orientation)  on  each  P-  or  N-curve 
is  automatically  defined  in  terms  of  t , 

Now  if  ^ (x,y)  is  allowed  to  be  as  before,  i.e.,  any  single-valued 
I function  whose  domain  is  the  entire  plane  and  whose  range  is  confined  to 

! . the  values  of  -1  , the  classical  theory  of  differential  equations  does  not 

provide  a definition  for  the  notion  of  a solution  of  (h).  If  (x,y)  is 

I 

] simple  enough  it  becomes  clear  how  such  a solution  should  be  defined;  but 

when  (p  (x,y)  is  general  this  is  not  so  clear,  and  some  care  must  be  taken. 

! 


The  definition  which  follows  seems  best.  It  is  expressed  mainly  in  geometri- 
cal terms,  but  the  translation  to  analytical  language  is  easy. 

Suppose  that  ^ is  the  poini,  from  which  the  solution  is 

sought,  and  suppose  - +1  , Then  one  of  the  three  following  mu- 

tually exclusive  possibilities  must  be  realized; 

(i)  There  exists  a P-arc  beginriing  at  p^  of  positive  length  along 
wnich  <f)  (x,y)  = +1  . 

(ii)  The  condition  (i)  is  not  satisfied,  but  there  exists  an  N-arc  from 
p^  along  which  ^ (x,y)  = -1  (excluding  ). 

(iii)  Neither  (i)  nox'  (ii)  holds. 

If  (i),  either  there  exists  a first  poiiit  after  on  the  P-curve 

from  p^  at  which  (x,y)  changes  sign,  or  there  does  nob.  If  there  does, 
the  solution  is  defined  to  begin  with  the  P-arc  from  p^  to  this  point  p^  , 
If  there  does  not,  the  solution  from  p^  is  defined  to  be  that  part  of  the 
F-curve  through  p^  which  follows  p^  (the  P-semi curve  from  Pq  )• 

If  (ii),  the  preceding  paragraph  should  be  applied  with  n in  place 
of  P . 


If  (iii),  no  solution  from  p^  is  defined. 

Cases  (i)  and  (ii)  thus  lead  either  to  a definition  of  the  entire  solu- 
tion from  p^  , or  to  a definition  oj  the  solution  up  to  some  definite  point 
P-,  = (x^,y^)  . In  the  latter  case,  tne  above  process  should  be  repeated. 


with 


p in  place  of  p , the  letters  P 
1 c 


N interchanged,  the  numbers 


-1  interchanged,  ana  the  phrase  "excluding  p^  " aaded  at  the  end  of  (i). 
Tais  will  lead  to  the  same  trichotomy;  either  the  solution  is  not  defined  be- 
yond P-,  , or  it  consists  of  a who]e  N-  or  P sf^mi curve  beginning  at  , or 

it  follows  Pt  with  a -infinite  ai’c  • Then  the  whole  process  should 

be  applied  to  (when  this  pcint  occurs;  other’;; Ise  there  is  nothing  left 

to  do);  but  now  P , N . and  -1^ should  be  in  their  original  places.  This 
either  accounts  for  the  rest  of  the  solution,  or  leads  to  a point  p^  like 


To  be  precise,  let  x(t)  , y(t)  represent  the  P-curve  such  that  x(0)  x , 
y(0)  - y , and  3et  t ..  inf  (tl  t^  0 , <^>  [ x(  t) , ,,  it )]  = -l}  . Then  ° 

the  "firSt  point"  mentioned  is  (x(  t ),y(  t ))  , 
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i 

1 

1 


which  should  be  treated  like  ; and  so  on.  The  whole  curve  obtained 
in  this  way  is,  by  definition,  the  solution  of  (U)  from  « 

If  *^*(x^,y^)  = -1  , the  solution  from  p^  is  dr;fined  correspondingly. 

If  each  of  the  points  p (n  = 0,  1,  2, , . . ) is  assigned  to  the  adja- 

^ * 
cent  P-arc  if  ~ adjacent  N-arc  if  ^ (x^,y^)  ~ -1  , 

then  the  solution  of  (li)  from  p^  consists  of  a countable  (possibly  finite 

or  even  vacuous)  well-ordered  sequence  of  alternating  P-  and  N-arcs  such 

that  the  initial  point  of  the  first  arc  is  p^  , the  terminal  point  of  each 

arc  is  the  initial  point  of  the  next,  and  (p  (x,y)  = +1  on  the  P-arcs, 

-1  on  the  N-arcs. 

The  solution  of  (u)  from  a point  , if  it  exists , is  unique.  This 
follows  at  once  from  the  definition  ana  the  fact  that  P-  and  N-curves  are 
unique . 

It  is  also  easy  to  see  that  if  A is  the  solution  of  (U)  from  p , and 
p ' is  any  point  on  A , then  the  solution  from  p ' is  that  part  of  A 
which  follows  p . 

From  these  facts  it  follows  in  turn  that  a solution  cannot  intersect 
itself  at  a point  p unless  it  is  periodic  beyond  p . (Here  a solution 
can  be  "periodic  beyond  a point"  without  being  completely  periodic,  despite 
the  uniqueness,  because  our  solutions  are  defined  only  unilaterally.) 

A point  on  a solution  which  is  the  terminal  point  of  a P-arc  and  the 
initial  point  of  an  N-arc  will  be  called  a PN- corner.  NP  comers  are  de- 
fined analogously. 

II.  LINEAR  SWITCHING 

Linear  switching  occurs  when  <p  (x,y)  = sgn(Kx  + My),  i.e.,  when 
(p{x,y)  = 1 in  one  of  the  half -planes  determined  by  the  line  Kx  + My  = 0 
ana  -1  in  the  other.  This  case  for  the  equation 

^(x,|)  (0.b<l)  {2.) 

at 

* If,  e.g.,  ^ (x  ,y  ) = +1  while  the  two  adjacent  arcs  are  both  N-arcs, 

is  to  be  regarSed  as  a degenerate  P-arc, 


has  been  thoroughly  discussed  in  several  papers  (Fliigge-Lotz  (1),(2); 
FlUgge-Lotz  and  Klotter  (1)).  The  summary  of  their  results  given  in  this 
section  (based  on  Flugge-Lotz  (2),  Chap.  U)  will  serve  the  double  purpose 
of  showing  how  much  can  be  done  with  such  a <p  (x,y)  and  of  displaying 
some  of  the  unwelcome  phenomena  that  can  occur  in  such  problems. 

In  this  case  (U)  becomes 


dx 

dt 


= y 


^ = -X  - 

dt 


2by  + sgn(Kx  + My) 


(5) 


The  constant  b is  taken  as  fixed,  and  the  focus  of  attention  is  the 
pair  of  constants  K , M , We  shall  suppose  that  they  are  both  different 
from  zero;  what  happens  when  either  of  them  vanishes  is  essentially  the 
same  as  what  happens  in  one  of  the  other  cases.  There  will  then  be  four 
cases  to  consider,  as  tabulated: 


o 

A 

o 

V 

o 

A 

I 

III 

A 

O 

II 

IV 

As  is  well  known,  the  equations  for  the  P-  and  N-curves  belonging 
to  (5)  can  be  explicitly  computed;  they  may  be  expressed  in  the  fonn 

x(t)  = e + Be  - 1 

y(t)  = -e  [(b  - ai)Ae^^^  + (b  + ai)Be 
where  a = ^ -yA  " 

A = ^ -^y(O)  + (b  + ai)  [x(0)  ^ l]l  , b = A . 

Wherever  an  equivocal  sign  occurs,  the  upper  sign  pertains  to  the  P-system, 
the  lower  to  the  N-system.  These  equations  represent  spirals  spiralling 
into  the  foci  (-1,  0), 


(6) 


(V) 
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A solution  of  (5)  can  thus  be  obtained  explicitly  in  terms  of  a se- 
quence of  formula  pairs  (6),  defined  on  successive  intervals  of  time,  each 
representing  that  part  of  the  solution  between  two  successive  zeros  of 
Kx  + My» 

Case  I.  In  this  case  the  "switching  line"  Kx  ♦ My  = 0 passes 
through  the  second  and  fourth  quadrants.  To  the  right  of  it,  (x,y) = +1; 
to  the  left,  -1.  Thus  a solution  consists  of  a sequence  of  arcs  of  spirals, 
each  with  its  focus  on  its  own  side  of  the  line  Kx  + My  = 0 and  its  ends 
on  this  line. 

The  condition  for  the  existence  of  a periodic  solution  is  that  there 
should  exist  (say)  a P-arc  of  the  t^pe  described  whose  end-points  are 


CASE  I 


equidistant  from  the  origin^  fo'!"  then,  by  symmetry,  there  exists  an  N-arc 
on  the  other  side  of  the  switching  line  joining  the  same  two  points,  and 
these  two  arcs  together  form  the  periodic  solution.  We  shall  see  below 
that  a periodic  solution  can  occur  in  our  case. 

Let  S„.  S„  be  the  points  on  Kx  + Mv  = 0 where  a ?-  and  K-curve 
P N 

respectively  are  tangent;  let  fL  and  R^.  be  the  last  intersections  pre- 
ceding Sp  and  Sjj  with  the  switching  line  of  the  P-  and  K-curves  through 
these  points.  Sp  and  S^^  are  symmetric  with  respect  to  the  origin,  as 
are  Rp  and  R^^  . Suppose  that  b , K , and  M are  such  that  Rj^.  is  out- 
side the  closed  segment  S^S  ; then  the  situation  is  as  shown  in  Figure  A 
(next  page).  solution  starting- sufficiently  near  the  segment  S^S.,  will 

1 iv 

move  away  from  the  line  in  one  direction  or  the  other  — according  to  the 


side  of  the  line  on  which  its  initial  point  lies  — and  never  return  to  the 

switching  line  at  all.  Since  solutions  can  only  start  near  S-Sv  » nut  can- 

not  cross  it,  the  points  of  this  s-^gment  are  called  start  points.  (Strictly 

speaking,  (x,y)  is  not  oeiined  in  this  segment,  so  that  one  cannot  speak 

of  a solution  starting  on  it.)  Also,  since  any  solution  starting  in  this 

manner  doss  not  return  to  the  switching  line  but  merely  spirals  aown  to  one 

focus  or  the  other,  such  points  are  also  rest  points;  uhe  cotjtrol  represented 

by  the  function  ^ is  at  rest  on  the  solution  from  such  a point.  It  is 

easy  to  see  that  the  points  of  also  rest  points,  but 

not  start  points.  In  general,  irrespective  of  the  values  of  b , K , and 

M (for  Case  I),  the  segments  S„R  and  R..S  consist  of  rest  points  and 

r N 

the  segment  SpS.^  consists  of  start  points,  as  one  can  easily  convince  him- 
self. . 

In  the  case  illustrated  in  Figure  A no  periodic  solution  can  exist; 
for  it  can  be  snown  that  every  P-arc  wnich  lies  to  the  right  of  the  line 
Kx  + Ky  = 0 and  has  its  ends  on  this  line  also  has  the  property  that  its 
terminal  point  is  nearer  the  origin  than  its  initial  point;  thus  the  condi- 
tion for  a periodic  solution  can  never  be  satisfied. 

If,  however,  R_  and  are  on  the  segment  (Figure  B) , then 

there  c-.ists  a periodic  solution.  This  may  be  seen  as  follows;  the  parti- 
cular P-arc  RpSp  oegins  nearer  the  origin  than  it  ends,  by  assumption; 
but  P-arcs  which  begin  sufficiently  far  from  the  origin  on  tne  switening 
line  have  tne  reverse  property;  tnerefore,  by  continuity,  some  intermediate 
P-arc  of  this  tjuje  must  begin  and  end  at  t;ie  same  cistance  from  the  origin; 
and  this  is  exactly  the  condition  for  a periodic  solution.  Extended  analysis 
bears  this  cut,  and  shows  that  the  periodic  solution  is  unique  and  orbitally 
stable.  All  solutions  beginning  outside  the  periodic  solution  spiral  onto 
it,  and  those  solutions  which  begin  inside  the  periodic  solution  but  outside 
the  shaded  area  (which  repi-esents  rest  poii:ts)  also  spiral  onto  it.  When 
there  is  no  periodic  solution,  as  in  Figure  A,  all  solutions  have  fxnitsly 
many  corners  and  then  spiral  down  with  no  further  corners  to  one  of  the  foci 
(-1.  Oj  , Obviously,  none  of  these  kinds  of  behavior  is  welcome  in  terms  of 
the  problem  at  hand. 
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Case  II.  This  case  differs  from  the  first  in  that  the  switching  line 
Xx  + My  = 0 now  passes  through  the  first  and  third  quadrants.  The  arcs 
which  occur  in  solutions  are  as  in  Case  I.  No  periodic  solutions  occur  in 
this  case.  Let  Rp  , flj^,  , Sp  , Sj^  be  defined  as  before;  then  the  points 
Rp  , Sp  , 0 , lie  on  the  switching  line  in  this  order.  The  inter- 
vals RpSp  8-nd  Sj^Rjj  easily  seen  to  consist  of  rest  points;  but  on 

the  interval  SpS^^,  a new  phenomenon  occurs.  Consider  any  solution  which 
reaches  this  interval,  say  at  the  point  £.  What  does  the  solution  do  at 


CASE  II 


this  point?  It  should  have  a PN-corner  at  E , for  it  has  reached  a 
point  where  <f>  changes  sign;  but  the  N--cui?ve  from  E goes  b^-k  into 
the  same  half -plane  from  which  the  solution  entered  E , and  on  this 
side  a solution  can  contain  only  P-arcsj  on  the  other  hand,  the  solution 
certainly  cannot  follow  the  P--curve  through  £ beyond  this  point.  Thus 
the  solution  is  not  defined  beyond  E;  it  ends  at  E . For  this  reason, 
such  a point  is  called  an  end  point.  In  a manner  of  speaking,  end  poi-wts 
are  inverted  start  points  and,  like  start  points  (but  unlike  rest  points) 
can  occur  on  the  switching  line  only. 

Any  solution  starting  outside  the  region  just  considered  spirals  in 
toward  the  origin  until  it  reaches  a point  on  the  interval  ^rRp  ’ beyond 
which  its  behavior  is  detemined  by  the  above  considerations. 


In  practice,  due  to  mechanical  traits  of  the  physical  system  involved 
wtiich  prevent  it  from  obeying  our  idealized  hypotheses  exactly,  there  arises 
a time  lag,  this  means  that  a solution  meeting  the  line  Kx  + My  =*  0 ac- 
tually proceeds  for  some  distance  beyond  it  before  it  has  the  corresponding 
corner.  In  Case  I such  a time  lag,  provided  that  it  is  not  too  large,  does 
not  affect  the  essential  behavior  of  the  system;  such  a system  might  there- 
fore be  said  to  be  "structurally  stable  with  respect  to  time  lags."  But  in 
Case  II  the  presence  of  a time  lag  does  make  a difference;  for  consider  a 
solution  entering  an  end  point;  because  of  the  time  lag,  it  no  longer  ends 
thej'e,  but  proceeds  for  a certain  distance  beyond  and  then  has  a corner, 
where  a solution  is  still  defined.  From  this  corner  it  crosses  the  switching 
line  in  the  reverse  direction,  moves  for  a short  distance  beyond,  has  another 
comer,  and  so  on.  The  successive  intersections  of  such  a solution  with  the 
switching  line  move  away  from  the  origin,  so  that  sooner  or  later  one  of  uhe 
corners  lies  in  the  set  of  rest  points,  and  from  this  comer  the  solution 
proceeds  to  spiral  down,  without  further  corners,  to  the  corresponding  focus. 
(See  the  picture  below.) 


Ihe  situation 
sence  of  time  lags, 


an  Case  II  can  thus  be  summarized  as  follows;  In 
every  solution  either  terminates  in  an  end  point 


the  ab- 
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or  it  eve«tually  spirals  dcwn  to  one  of  the  two  foci  (-1,0).  In  the  presence 
of  a time  lag,  all  solutions  behave  in  the  latter  way.  Thus  Case  II  is  also 
unfavorable  from  our  point  of  view. 

Case  III.  In  this  case  the  switching  line  lies  as  in  Case  II,  but  the 
arcs  which  occur  in  solutions  now  belong  to  spirals  about  the  focus  on  the 
side  of  the  line  opposite  from  the  arc  itself.  In  other  words,  P-arcs  occur 
on  the  left,  N-arcs  on  the  right..  In  this  case  a stable  periodic  solution 
always  exists,  and  it  dominates  the  whole  situation,  for  all  other  solutions 
spiral  onto  it. 
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CASE  III 

That  a periodic  solution  exists  can  be  seen  as  follows;  consider  the 

two  arcs  BC  and  B'C  , where  B lies 
very  near  the  origin;  then 

OC  - OB  > 0 . 

But.  by  the  character  of  the  spirals, 
if  3'  is  far  enough  out 

OC'  - OB”'  < C . 

Thus,  by  continuity,  uhere  must  oe  an 
.intenr.ediute  arc  " such  chat 


OC"  - OT"  ■ 0 ; but  this,  as  pointed  out  earlier  (p.  9)*  is' exactly  the 
condition  for  the  occurrence  of  a periodic  solution.  An  extended  discus- 
sion of  such  periodic  solutions  can  be  found  in  Bilhajm  (l)j  quantitative 
information  about  the  particular  periodic  solutions  arising  here,  for 
vai'ying  values  of  the  parameters,  are  given  in  Flugge-Lotz  (2). 

The  behavior  of  the  solutions  within  the  periodic  solution  is  simple 
enough.  It  is  clear  that,  if  Sp  and  are  the  points  of  tangency  defined 
as  before,  the  segment  consists  of  starting  points*  A solution 

starting  from  such  a point  (in  either  direction)  simply  spirals  out  to  the 
periodic  solution;  and  since  the  totality  of  solutions  obtained  in  this 
way  covers  the  interior  of  the  region  bounded  by  the  periodic  solution, 
•Uiere  are  no  other  solutions  to  consider,  (See  the  picture  below.) 


Thus  all  solutions  spiral  onto  the  periodic  solution  and  this,  from 
the  point  of  view  we  have  adopted,  is  also  unfavorable.  It  may  be  seen  that 
in  this  case,  as  in  Case  I,  we  have  "structural  stability  with  respect  to 
time  lags";  i.e.,  the  presence  of  a small  time  lag  would  not  change  things 
essentially. 

Case  IV.  In  this  case  the  switching  line  is  as  in  Case  I,  the  arcs 
of  solutions  as  in  Case  III.  It  may  be  shown  that  no  periodic  solution 
can  exist  in  this  case;  in  fact,  every  arc  with  its  ends  on  the  switching 


line  has  Its  teraln&l  point  nearer  the  origin  than  its  initial  point,  and 
the  condition  for  a periodic  solution  cannot  be  satisfied. 


CASE  IV  WITHOUT  TIME  LAG 


In  this  case  the  segment  SpS^  consists  of  end  points,  and  by  tracing 
the  solutions  which  end  on  it  backwards  one  can  see  that  these  cower  the 
entire  plane;  thus  in  this  case  all  solutions  end  on  the  segment  SpSjj  of 
the  switching  line. 

But  here  again,  as  in  Case  II,  the  presence  of  a time  lag  makes  a 
difference.  The  time  lag  makes  no  difference  of  importance  until  the  so- 
lution in  question  reaches  SpSj^;  then,  instead  of  ending,  the  solution 
proceeds  for  some  small  distance  beyond  the  switching  line,  has  a comer, 
recrosses  the  switching  line,  has  another  comer,  and  so  on.  It  may  be 
seen  that  the  successive  points  of  crossing  obtained  in  this  way  have  the 
pmperty  ihau  each  is  closer  to  the  origin  than  its  predecessor,  until  one 
of  them  1:.03  on  uhe  other  side  of  the  origin.  After  this  has  happened  the 
solution  oscillates  around  the  origin  in  a more  or  less  irregular  way,  but 
with  a high  mean  frequency  and  small  mean  amplitude.  This  is  the  most 
favorable  of  the  possibilities  so  far  considered,  for  every  solution  moves 
into  the  orrLgin  with  the  passing  of  time,  and  this  irrespective  of  the  mag- 
nitude of  the  cons  tents  b , K , and  M . But  the  manner  in  which  it  does 
so  is  imsatisfactcx-y,  for  it  involves  a rapid  fluctuation  in  the  sign  of  ^ 
("chattering")  which,  in  general,  continues  indefinitely. 
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Thus,  at  best,  linear  switching  leads  to  solution  behavior  which,  both 
qualitatively  and  (as  we  shall  see)  quantitatively,  is  far  from  perfect;  it 
will  be  shown  later  (Theorem  7)  that  all  its  defects  can  be  avoided  by  taking 
a different  kind  of  . 


Ill,  THE  MINIMAL  THEORY;  GENERAL  CONSIDERkTIONS 

We  now  return  to  the  problem  stated  at  the  bottom  of  page  3.  If  the 
second  order  equation  (3)  is  replaced  by  the  equivalent  first-order  system 
(U),  the  problem  may  be  described  as  that  of  finding  a function  (x,y) 
such  that  for  any  point  p in  the  x,y-plane  the  solution  from  p of 

^ = y » ^ = «^(x,y)  '■  g(x,y)  = -1,  g £ C^]  (u) 

has  the  following  properties: 

(i)  it  passes  through  the  origin  (x  - y = 0); 

(ii)  the  length  of  time  necessary  to  move  from  p to  the  origin  along 

the  solution  from  p is  minimal  with  respect  to  <p  ; i.e,,  no  other 

could  make  this  time  shorter. 

(The  parametrization  of  a solution  in  terms  of  t is  naturally  in- 
duced in  the  obvious  way  by  the  known  parametrization  in  terms  of  t of 
its  component  arcs;  thus  the  "time"  required  to  move  from  one  point  on  a 
solution  to  another  which  follows  it  is  a well-defined  quantity.) 


The  object  of  this  section  is  to  study  the  problem  in  this  general 
form.  It  will  first  be  shown  how  the  problem  is  equivalent  to  one  of  a con- 
ceptually simpler  character,  and  then  (in  Theorem  1)  it  will  be  shown  that 
this  problem  in  turn  can  be  greatly  simplified. 

It  has  been  pointed  out  that  a solution  of  (U)  consists  of  a sequence 
of  alternating  F-  and  N-arcs,  the  initial  point  of  the  first  being  the  ini- 
tial point  of  the  solution,  and  the  terminal  point  of  each  being  the  initial 
point  of  the  next.  Our  point  of  view  will  be  to  consider,  for  an  arbitrary 
point  p in  the  plane,  the  class  of  all  curves  of  this  kind  which  begin  at 
p and  pass  through  the  origin;  and  our  purpose  will  be  to  find  in  this 
class  a curve  along  which  the  time  necessary  to  reach  the  origin  is  shortest. 
Thus  we  make  the  following  definitions  (the  function  g(x,y, ) , and  there- 
fore the  P-  and  N-systems,  being  fixed); 

A path  from  the  point  p is  a finite  or  countable,  well-ordered  se- 
quence of  alternating  P-  and  N-arcs  such  that: 

-JJ- 

1)  The  sum  of  the  lengths  of  the  arcs  is  finite  (=  t ). 

2}  The  initial  point  of  the  first  arc  is  p . 

3)  The  terminal  point  of  each  arc  is  the  initial  point  of  the  next. 

h)  If  there  are  finitely  many  arcs,  the  terminal  point  of  the  last 
arc  is  the  origin;  if  there  are  infinitely  many,  then 

x(t)  0 and  y(t)  -^0  as  t t , 

the  curve  composed  of  the  arcs  being  parametrized  in  the  obvious  way  in 
terms  of  the  parametrization  of  the  component  arcs. 

5)  No  two  of  the  arcs  intersect. 

(In  order  to  avoid  a conflict  between  3)  and  5),  we  regard  each  arc 
as  containing  its  initial  point  but  not  its  terminal  point;  this  convention 
will  have  no  effect  on  the  time-length  characteristics  of  paths,  and  there- 
fore does  not  really  restrict  the  generality  of  what  follows,) 


"Length, " "longer, " and  similar  expressions  should  be  understood  to  refer 
to  time,  not  geometric  length,  here  and  throughout  what  follows. 


A path  from  p can  therefore  almost  be  described  as  a curve  which 
could  occur  as  that  part  of  a solution  from  p (for  some  </>  ) which  con- 
nects p with  the  origin.  "Almost,”  because  5)  need  not  hold  for  a solu- 
tion of  (ii).  This  point  will  be  cleared  up  presently. 

A path  from  p which  is  not  longer  than  any  other  path  from  p will 
be  called  a miniitial  path  from  p . 

In  order  to  solve  the  problem  stated  on  page  17  ^ ^ sufficient  to 
find  a unique  minima 1 path  from  each  point  p . 

Namely,  one  needs  only  to  define  <p  (x,y)  = »-l  on  P-arcs  which  oc- 
cur in  the  minimal  paths,  and  (/>  (x,y)  = -1  on  the  N-arcs  which  occur  in 
the  minimal  paths.  ('^(0,0)  is  to  be  left  undefined,  or  it  can  be  given 
either  value.)  Such  a <p  (x,y)  automatically  yields  the  minimal  paths  as 
solutions,  and  the  minimal  path  from  a point -is,  by  definition,  the  shortest 
possible  solution  connecting  p with  the  origin.  Two  things  must  be  veri- 
fied: (1)  that  this  method  defines  (x^y)  uniquely  at  every  point  except 
the  origin,  and  (P)  that  notning  is  lost  by  leaving  out  of  consideration 
those  possible  solutions  for  wnich  5)  fails. 

To  verify  (1),  observe  first  that  every  point  p must  lie  on  at  least 

one  minimal  path,  namely  the  minimal  path  which  begins  at  p . Thus  (x, y) 

is  defined  everywhere.  If  there  were  some  point  at  waich  it  failed  to  be 

unique,  then  this  point  p would  need  to  lie  both  on  an  N-arc  belonging  to 

one  minimal  path  A (from  the  point  a ) and  on  a P-arc  belonging  to  an- 

a I; 

other  minimal  path  A ^ (from  b ),  Denote  those  parts  of  A^  and  A^  w-la. 
lie  between  p and  the  origin  by  A'^  and  A'^  respectively;  then  their 
time  lengths  t(A'  ) and  t (A'  ) stand  in  .some  relation  to  each  other, 
say  T (A-^)  ^ t(A'^^;.  Then  ^ ~ A‘^  + ^ ^ • A^  " A'^  + A^ 

may  not  be  a true  path  (for  it  may  cross  Itself),  but  a true  path  A^^  may 
be  obtained  from  it  by  cutting  out  whatever  closed  loops  or  retracings  it 
may  containj  and  obviousI.y  ^ ^ '*'(Aj^),  which  contradicts  the  assump- 

tion that  A^  was  the  unique  minimal  path  from  b . 

To  check  (2) f it  will  suffice  to  show  that  any  curve  A which  might 
occur  as  a solution  connecting  p with  the  origin  and  failing  to  satisfy 
5)  can  be  replaced  by  another  such  curve,  at  least  as  short,  which  satisfies 
5)  and  is  therefore  a path.  Let  A be  as  descrxbed,  and  let  A'  be  the 
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path  obtained  from  A by  cutting  off  all  of  A beyond  the  first  intersec- 
tion of  this  curve  with  the  origin.  A'  contains  the  origin  just  once,  and 
ends  there;  since  A is  not  longer  than  A , it  is  enough  to  show  that  A' 
satisfies  5).  Suppose  the  contrary;  if  A-  intersects  itself  at  a point 
q beyond  which,  by  page  7,  It  is  therefore  periodic,  it  follows  that  A' 
contains  the  origin  at  least  twices  once  folj.owing  the  second  passage 
through  q (for  A'  ends  at  the  origin)  and  therefore  on  the  period  con- 
necting the  first  passage  of  A-  through  q with  the  second.  This  is  a 
contradiction. 

The  problem  with  which  we  shall  actually  be  concerned  is  therefore 
that  of  finding  a unique  minima]  path  from  any  point  p , given  the  func- 
tion g(x,y)  . 

A path  will  be  called  canonical  if  it  contains  no  NP-corners  (see 
p.  7)  above  the  x-axis  and  no  FN-corners  below. 

‘ Theorem  1,  Given  any  path  A from  p which  is  not  canonical,  one 
can  find  ^ canonical  path  from  p which  is  shorter  (in  terms  of  time) 
than  A . 

Proof.  (In  saying  that  a corner  lies  above  or  be],ow  the  x-axis,  we 
mean  that  the  arcs  adjacent  to  the  corner  are,  for  ''values  of  t suffi- 
ciently near  the  value  corresponding  ■‘■o.  the  corner  itseif,  above  or  bvolow 
the  x-axis  respectively;  the  corner  its-^lf,  -regarded  as  a point,  may  thus 
lie  on  the  axis.) 

The  idea  of  the  proof  is  simp‘'e;  given,  ..say,  a path  with  the  NP - 
corner  p ■ aba'c  the  x-axis,  on^  denotes  by  p-  either  the  last  corner  of 
the  path  preceding  p or  the  .last  intersection  preceding  p of  the  path 

with  the  x-axis  (whioi.'eve''  i.s  nearer  p ), 
and  denotes  by  p”  the  corresponding  point 
with  "fo.l lowing"  in  place  of  "preceding"  and 
"first"  In  place  of  "last,"  One  then  draw.s 
the  P cvirv^-  fcnifard  from  p*  and  the  N-curve 
backward  from  p"  , thereby  obtaining  a four- 
sided  figure  aj  shown.  If  one  now  modifies 
the  given  path  by  replacing  p‘pp"  by  p'p"'p", 


the  NP“Corner  p is  removed,  no  other  such  comer  is  introduced,  and  the 
path  is  shortened.  To  see  this  last  fact,  note  that,  by  (ii), 

t(p>pp«)  = J*  t(p'p'»p")  = J ^ 

p'pp"  p'p"'p" 

(These  two  integrals  must  ccmverge,  for  the  quantities  T(p’pp")  and 
t(p'p"'p")  are  obviously  finite. J However,  y is  greater  (for  a given 
value  of  X ) on  p'p'"p"  than  on  p'pp"  j therefore  the  second  integral 
is  smaller  than  the  first,  as  was  claimed.  Thus  if  one  applies  this  pro- 
cess to  every  NP-corner  above  the  x-axis,  and  tne  corresponding  process 
to  every  PN-corner  below  the  axis,  one  obtains  a canonical  path  shorter 
than  the  given  one. 

Two  things  must  be  proved:  (1)  that  it  is  always  possible  to  con- 
struct the  "quadrilateral"  of  the  type  shown}  and  (2)  that  the  process 
described  does  not  produce  any  self-crossings,  so  that  a true  path  is  in 
fact  obtained. 

Let;  p , p'  1 and  p"  be  as  described  above;  if  the  initial  point 
of  a path  is  regarded  as  a corner,  p'  always  exists,  and  p"  always 
exists  since  the  path  goes  to  the  origin'.  It  will  be  shown  first  that  .. 

II  .. 

the  P-semicurve  FI  beginning  at  p'  passes  over  p'pp"  ^d  crosses  the 
vertical  line  through  p"  , That  D moves  to  the  right  as  long  as  it 
remains  above  the  axis  follows  from  the  first  equation  in  (U).  Suppose 
that  n is  parametrized  by  t in  such  a way  that  t = 0 gives  p ' ; 
then  n has  one  of  the  following  two  properties; 

(i)  n goes  arbitrarily  close  to  the  x- axis  as  t -^oo  (i.e.,  it 
either  crosses  the  x-axis  for  some  t > 0 or  lin^  inf  y(t)  [on  D]  = 0 . 

(ii)  n goes  to  infinity  in  the  sense  that 

sup  ^xl  (x,y)  < n}  = +«> 

For  assume  that  (i)  is  false;  then  there  exists  a number  e > 0 

and  a value  t > 0 of  t such  that  for  t > t , y(t)  [on  E]  > < , 
dx 

Since  ~ ^ ^ this  gives 
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x(t)  = J y(  ^ )d'*'  ^ ^ ^ “ *q)  ’■►00  as 't 


CO 


so  that  (ii)  holds. 

Since,  as  t increases,  II  moves  steadily  to  the  right,  it  is  clear 
what  n must  do;  it  must  either  move  off  to  infinity  as  in  (ii),  or  cross 
the  x-axis  at-some  point,  or  tend'^to  some  point  on  the  x-axis  as  t-^oo  , 

n starts  off  from  p'  above  p'pp”  , for  in  the  upper  half  plane 
the  P-curve  through  a point  always  has  a greater  slope  there  than  the  N- 
curve  through  that  point,  and  even  if  p'  lies  on  the  x-axis  (whereupon 
the  two  slopes  are  "equal"  --  both  infinite)  the  radius  of  curvature  of  the 
P-curve  at  -p'  is  greater  than  that  of  the  N~curve.  (All  this  follows  fron 
(ii)j.  in  particular,  the  fact  that  we  can  talk  about  radii  of  curvature  fol- 
lows:frbm  the  fact  that  g(x,y)  t , so  that  x(t)  and  y(t)  — the  func- 
tions defining  fl  — have  continuous  second  derivatives.)  H cannot  cross 
the  N-arc  p 'p  , by  what  was  just  said  about  slopes;  it  cannot  cross  the 
P-arc  pp"  , for  P-curves  are  unique;  and  it  cannot  tend  as  t — ► oo  to 
either  of  the  points  p or  p"  (one  or  both  of  which  may  be  on  the  x-axis) 
for  this  would  imply  that  the  point  concerned  would  be  a singular  point  of 
the  P-sys tern,  which  would  in  turn  belie  the  fact  that  both  points  belong  to 
the  ordinary  finite  P-arc  pp"  . Thus  all  that  was  claimed  for  n is  true. 

The  corresponding  argument  can  be  applied  to  N , the  N-semi curve  ob- 
tained by  following  the  N-curve  through  p"  backwards,  and  it  turns  out 
that  it  too  lies  above  p'pp"  and,  in  particular,  crosses  the  vertical  line 
through  p'  . Thus  D and  N must  intersect  at  least  once.  That  they  in- 
tersect only  once  may  be  seen  in  several  ways,  the  simplest  of  which  perhaps 
is  to  observe  that  if  they  intersected  twice  (with  no  other  intersections 
between),  one  of  the  intersections  would  involve  a crossing  with  the  wrong 
inequality  between  the  slopes.  Thus  we  obtain  the  unique  intersection  p"‘ 
and  the  "quadrilateral"  sought. 

This  proves  (1)  on  page  21,  To  prove  (2),  we  note  first  that  in  the 
process  just  discussed  (and  its  complement  for  the  lower  half -plane)  the 
upper  and  lower  half-planes  are  treated  separately,  so  that  in  looking  for 
possible  self-crossings  introduced  by  this  process  we  need  only  consider 
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(say)  what  happens  to  those  parts  of  the  original  path  which  lay  above  the 
x-axis.  Let  and  , then,  be  any  two  parts  of  the  original  path, 

each  contained  between  successive  points  where  the  path  crosses  the  x-axis 
and  in  the  upper  half -plane.  It  is  clear  that  the  process  for  removing 
NP-corners  cannot  introduce  self-crossings  in  either  of  the  separate  pieces 
or  ; hence  it  is  only  necessary  to  show  that  the  process  can  not 

cause  to  cross  . Since  and  do  not  cross,  their  ends 

^1  ’ ^1  ’ ^2  * ^2  X-axis  must  lie  in  one  of  the  following  orders: 

(l)  1 » ^2  ’ ^2  * ^1  ’ ^2  * ^2  * ^1  * ^2  ’ ^1  ’ ^1  * ^2  * 

(U)  ^2  ’ ^2  * ^1  * '^l  * essentially  the  same  as  (li),  and  (2)  as  (3), 

so  we  consider  only  (1)  and  (2), 

Our  process  has  the  property  that  it  leaves  the  points  where  the  given 
path  crosses  the  x-axis  unchanged;  no  such  points  are  removed,  and  none  are 
introduced.  Therefore  the  curves  and  belonging  to  the  final 

path  and  obtained  from  and  by  the  process  have  the  same  ends  on 

the  x-axis  as  before.  Now  in  case  (1)  there  is  nothing  further  to  say,  for 
lies  entirely  over  the  interval  (i  ~ 1,  2)j  since  these  inter- 
vals are  disjoint,  and  cannot  intersect.  In  case  (2),  and 

mus^  beas  shown:  each  H. ' consists 
of  a.P-arc  followed  by  an  N-arc  (one  of 
which  might  be  vacuous).  Suppose  they 
intersected;  say  the  intersection  oc- 
curred on  the  arc  a^c^  . Then,  since 
^1  ^2  ^2  *^1  ^2*^2  intersect  (both  of 

them  being  P-arcs),  ^2^2  inter- 

sect c^b^  ; by  the  previous  argument  about  slopes,  these  two  arcs  can  only 
intersect  once;  therefore  intersect  c^b^  , since  b^  lies  be- 

tween a^  and  b^  . But  this  is  impossible,  for  these  are  both  N-arcs. , 

The  same  line  of  reasoning  applies  if  ^^2  assumed  to  intersect  a^c^. 
Thus  and  are. disjoint,  as  claimed;  and  this  completes  the  proof 

of  Theorem.  1. 

Corollary.  ^ seeking  a minimal  path  from  a point  it  is  only  neces- 
sary to  consider  canonical  paths  from  that  point. 


For  it  follows  directly  from  Theorem  1 that  a path  which  is  minimal 
with  respect  to  the  class  of  all  canonical  paths  is  also  minimal  with  re- 
spect to  the  class  of  all  paths,  this  point  ^ ^ will  therefore  be 

tacitly  assumed  that  all  paths  mentioned  are  canonical. 

If  g(x,y)  has  the  particular  property  that  g(  x, -y)  -g(x,y  ) , 
there  is  more  one  can  say.  For  then,  if  in  the  equations 

^ * y , ^ = ^(x,y)  “ g(x,y)  (U) 

we  make  the  substitutions  ,x  - -X  , y = -Y  , '/>Cx,y)  = -'//(X,Y)  , we  obtain 

g-I,  ■('(X.Y)  - g(X,Y) 

i.e,,  equations  of  exactly  the  form  .fh) . This  means  that  p and  q 
are  two  points  symmetrical  with  respect  to  the  origin,  then  whatever  can 
be  said  about  the  P-  and  N -curves  at  p can  be  said  about  the:  M-  and  P- 
curves  at  q , (E.g,,  if  it. can  be  proved  that  a minimal  path  from  p 
must  begin  with  a P-arc,  it  follows  at  once  that  a minima j.  path  from  q 
must  begin  with  an  h-arc.)  “ 

Since  g(x,y)  aDways  has  the  property  mentioned  when  it  is  linear 
(homogeneous),  ana  g(x,y)  will  be  of  this  type  in  most  of  what  follows, 
this  observation  will  find  extensive  use.  Any  result  obtained  from  an- 
other , by  an  appeal  to  it  will  be  said  to  have  been  obtained  by  symmetry. 

IV.  THE  MINIMAL  THEORY:  g(x,y)  = by 

This  section  begins  the  systematic  study  of  the  prob-em.  discussed 
in  III  for  the  important  case  that  gtx,y)  is  linear.  3y  page  .3,  it  will 
suffice  to  suppose  that  g(x,y)  has  one  of  the  three  forms  by  , x + by  , 
-X  + by  , where  ir.  each  case  b is  an  arbitrary  real  constant.  In  fids 
section  we  study  the  first  of  these;  J.t  arises  from  the  physical  example 
given  at  the  very  outset  and,  of  course,  in  many  other  ways.  The  correct 
solution  of  the  problem  for  b 5 0 has  been  previously  stated  (McDonald 
(1)),  but  without  a convincing  argument. 
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The  character  of  the  P-  and  N-systems  associated  with  the  equations 


^ (x,y)  - by 


depends,  of  course,  on  the  value  of  b j but  this  much  can  be  said:  since 
the  left  members  (disregarding  <p  ) depend  only  on  y , all  the  P-curves 
can  be  obtained  from  any  one  of  them  by  translation  along  the  x-axis.  The 
same,  of  course,  can  be  said  for  the  N-curves,  which  are  in  fact  obtained 
from  the  P-curves  by  reflection  in  the  origin. 

The  P-curves  determined  by  the  origin  and  (for  b ^ 0 ) the  point 
(0,2/b)  for  representative  values  of  b are  sketched  below;  that  the  solu- 
tions have  the  qualitative  properties  involved  can  be  easily  verified. 


When  b < 0 , something  strange  (but  not  unexpected)  occurs:  there 

exist  points  from  which  there  are  no  paths.  More  precisely,  a path  from 

the  point  (x_,y^)  exists  if  and  only  if  ly^l  -b  ^ . For  suppose 
O O — ■ o 

y < b < 0 ; then 
•'o’ 

+ 4 

= -l-  by  <-1-1^0. 

This  means  that  once  any  curve,  P-  or  N-,  is  below  the  line  y = b ^ it 
stays  below;  thus  no  path  from  such  a point  could  cross  this  line,  as  it 
would  need  to  do  to  reach  the  origin.  Similarly  for  any  point  above  the 
line  y = -b  ^ > 0 , That  paths  exist  from  any  point  between  the  two 
lines  will  be  seen  below. 
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When  b ^ 0 , a path  can  be  found  from  any  point-  in  the  plane. 


We  shall  denote  by  F that  part  of  the  P-curve  through  the  origin 
which  lies  below  y = 0 , and  by  F"  its  reflection  in  the  origin.  F~ 
is  therefore  that  part  of  the  N-curve  through  the  origin  which  lies  above 
y = 0 . 


Theorem  2.  (g(x,y)  = by.)  Let  C - F~+  0 + F . This  is  a simple 
curve  which  divides  th-2  plane  into  an  upper  and  a lower  part.  The  unique 
minimal  path  from  any  point  p above  C ( and  below  y ~ ' L b < 0 ) 

is  obtained  by  following  the  H curve  from  p until  it  reaches  0(F)  and 
then  following  F into  the  origin.  If  p lies  below  C (and  above 
y = b ,if  b''0)  the  unique  minima]  path  is  given  by  following  the 
P-curve  from  p to  C(F”)  and  then  following  F into  the  origin.  (The 
solution  of  the  original  prob.l  ::,7,  i -s  to  take  :jl)  ~ -I.  above  C and  on 

F",  +1  below  C and  on  F . ) 


Proof . Note  first  that  if  pq  is  an  N--arc  with  p on  the  x-axis 
and  qr  is  the  F-arc  from  q b?ok  to  the  axis,  then  r(pqr)  , the  total 
time  length  of  this  pair  of  arcs,  is  a m-inotone  increasing  function  of  the 
distance  between  p end  r , or  equivalently  of  the  are-a  bounded  by  pqr 
and  the  segment  pr  of  the  axis;  for  t(pq)  and  r (q*')  ire  both  in- 
creasing fiinctions  of  • y , y being  the  ordinate  of  q ^ but  this  in 
turn  is  such  a function  -of  the  two  quantities  mentioned. 


We  shall  denote  that  path  from  p which,  according  to  the  theorem, 
is  minimal  bv  A „ The  rest  of  the  prf -mf  will  be  broken  up  into  several 

p 

parts. 


A.  ^ p liss  on  the  posit-i ve  half  of  the  x axis,  ^ the 

unique  mi-nimal  path  from  p . 


Let  A be  any  l ath  i’rorr  p ot  her 

p 

no  PN"Corners  below  the  axis  ov  NP  corners 
from  p with  an  N -ar'^  pq  , hnich  maj 
tive  ].eiigtbj  q therefor-e  ties  be]  ow  the 
a P -arc  qr  whie'e  crosses  the  axis.  \ I.r 
have  A ; and  if  r Vav  :r.  be]  ew  'he 
comer  -of  the  •'excluded  t-.-pe.)  Ler  int 


than  A . By  page  2h,  A ■ 

P P 

aueve.  Tt  there.^om  -start.s  out 

(see  below)  assume  to  bo  of  posi- 

axis.  Prom  q the  path  follows 

’’  wore  the  origin,  w 3 would 

X axis  elsewhere,  i*  would  be  a 

•erseetion  -rt  qr  with  the  x-  axis 
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be  V . The  N-arc  of  fron  r , say  rq^  , again  crosses  the  axis  or 

ends  at  the  origin,  for  similar  reasonsj  and  so  on.  By  continuing  this 
kind  of  reasoning,  and  recalling  that  does  not  cross  itself,  one  dis- 

covers that  Ap  must  be  of  the  form  shown.  (To  see  what  happens  when  the 
initial  arc  of  Ap  is  a P-arc,’  think  of  v in  place  of  p .)  In  order 

for  this  path  to  reach  the  origin, 

— ji y 1 sooner  or  later  one  of  the  points  v^ 

0 ^2  (v  pVl  ^2  must  lie  at  or  to  the  left  of  the 

* T T T Tj  1 origin,*  If  p is  that  intersection 

y 0^"  A^  with  the  x-axis  which  immedi- 

<52  ‘ ately  precedes  a for  which  this 

happens,  then  p q v is  an  arc-pair  of  the  type  discussed  at  the  be- 

ri  n n ^ 

ginning  of  the  proof,  and  is  clearly  lower  than  the  NP-comer  of  A^  j 

hence  r(A)>r(pqv)  >t(A^),  which  shows  that  A is  the  unique 
p"  '*^n^n  n p p 

minimal  path  from  p , 

B,  p lies  below  y « 0 and  above  F (and,  if  b > 0 , 
above  y = -b  ) then  the  unique  minimal  path  from  p 1^  ^ p * 

Suppose  that  A were  a path  from  p such  that  r (A  ) $ t (A  ) , 

One  can  join  p to  the  x-axis  by  following  the  N-curve  through  p back- 
wards. If  the  point  on  y = 0 reached  in  this  way  is  p'  , then  pp'  +A^ 
is  (after  the  elimination  of  any  loops,  etc.,  that  it  may  contain)  a cer- 
tain path  (not  necessarily  canonical)  from  p'  , say  A^^  ; then 

^(^p.)  ^ ^(PP'  +^p)  = T(pp')  + t(A^)  < r(ppi)  + t(S^) 

= T(pp>  +^^)  = r(^,) 

* 

and  this  contradicts  the  fact  (from  A)  that  A^,  is  the  unique  minimal 
path  from  p'  , 

C.  ^ p lies  below  y = 0 , ^ the  left  of  F , and,  ^ b 0 , 

■ above  y = - 1 b i , then  the  unique  minimal  path  from  p . 


They  cannot  merely  approach  0 as  a limit  point,  for  this  would  imply 
the  existence  of  infinitely  many  pieces  like  PnQn^n  » each  of  these 
pieces  would  be  longer  than  the  innermost  one,  so  that  the  length  of  the 
entire  path  would  necessarily  be  infinite. 
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Consider  any  path  other  than  2^  from  p j it  starts  put  from  p 
with  an  N-arc  pq  (necessarily  lying  entirely  below  y ° 0 ) and  from  q 
follows  a P-arc  across  the  axisj  but  beyond  this  crossing,  by  the  result 
obtained  from  A by  symmetry,  we  may  suppose  that  the  path  simply  goes  to 

a point  r on  F”  and  then  follows 
\ r~  into  the  origin.  Let  the  inter- 

\ section  of  qr  with  the  axis  be  v j 

\ then  -x^  , and  therefore  r (vrO)  , is 

I 1® a strictly  increasing  function  of 

\ \ T (pq)  ; likewise,  since  ^ 0 

\ the  N-curre  frcrni  p , t (qr)  is 

\ such  a function  of  r(pq)  ; alto- 

Xr  gether  thus  an  increasing 

f\mction  of  'r(pq)  , and  therefore 

takes  on  its  least  value  when  r(pq)  = 0 ; but  this  gives  2 . 

P 

A , B , and  C prove  the  theorem  in  the  lower  half-plane  for  every 
case  but  that  in  which  b > 0 and  p lies  below  y = -b  ^ , The  proof 
in  B bre^s  down  for  this  case  because  the  N-semicurve  ending  at  p 
does  not  reach  the  x-axisj  the  proof  in  C fails  because  > 0 the 
N-semicurve  beginning  at  p , This  bothersome  case  will  be  dispatched 
by  proving  a sequence  of  statements; 


■^y  psth  from  a point  above  F and  below  y = -b 
must  begin  with  an  N-arc  which  reaches  F . 


(b  > 0) 


For  if  a path  Ap  began  with  an 


N-arc  which  fell  short  of  F (if, 
in  particular,  it  were  vacuous), 
the  succeeding  P-arc  could  not 
enter  the  origin,  but  couid  only 
cross  the  axis  to  the  right  of 
0 and  break  off  in  a comer 
above.  The  N-arc  from  this  point 
must  again  cross  the  axis,  but  it, 
as  well  as  any  P-arc  following  it, 
stays  above  the  line  y -b”^  j 
thus  such  a path  could  not  get  to 


the  origin  without  crossing  itself 


Dg*  F is  ^ point  below  the  line  y = -b  ^ (b  > 0),  and 

is  a path  from  p which  starts  along  an  N-arc  pq  , follows  the  P- curve 

from  q to  F"  and  then  follows  F”  into  the  origin,  then  r(A  ) is 

F 

£ strictly  increasing  function  of  r(pq)  . 

Let  the  PN-corner  (on  F~  ) of  A be  r , and  write  X = r(pq)  , 

P 

fi  = r(qr)  , cr  = T(rO)  ; then  it  is  to  be  proved  that  ^ + /i  + o‘)>  0 

The  best  way  to  do  this  seems  to  be  the  following;  it  is  easy  to  verify 
that  the  P-  and  N-systems  are  given  by  the  equations 

x(t)  = A + Be  ^ i t , y(t)  = -bBe"^^  - i , 


where 


the  upper  signs  giving  the  P-curves,  the  lower  giving  the  N-curves.  Using 
these  formulae  and  the  lengths  of  the  three  arcs  of  get  ex- 

pressions for  the  values  of  the  coordinates  of  the  two  corners  of  this 
path,  and  of  the  origin  regarded  as  the  end  of  the  third  arc.  Upon  elimi 
nating  as  much  as  possible  from  this  system  of  equations,  one  obtains 

\ - LL  + cr  = bx  + y = constant 
^ P P 

and  therefore,  upon  differentiating  with  respect  to  X , 


1 + 


da 

dX 


It  is  obvious  that  ^ 0 j therefore  ’’  0 , ana  the  desired  in- 

dX  dX 

equality  follows  at  once. 

Now  it  is  easy  to  prove  that  for  such  a point  is  the  unique 

minimal  path.  In  fact,  if  p is  to  the  right  of  (above)  F , a path 
F from  p must,  by  D^,  begin  with  an  N-arc  which  goes  at  least  far 
enough  to  meet  F ; after  meeting  F the  path  may  be  supposed,  as  in 


the  proof  of  C,  to  be  as  described  in  If  P lies  on  the  left  of  F 

on  the  other  hand,  this  may  be  supposed  at  once.  But  says  that  the 
shortest  such  path  is  obtained  by  taking  the  initial  N-arc  as  short  as 
possible.  When  p is  to  the  right  of  F , this  means  taking  q (the 
first  corner)  on  F j when  to  the  left  of  F , taking  the  initial  N-arc 
vacuous.  In  either  case,  the  path  so  obtaired  is  exactly  ^ 

P 

A , B , C , ana  what  has  just  been  proved  estsblisb  Theorem  2 for 
points  in  the  open  lower  half-plane  and  on  the  positive  half  of  the  x-axisj 
the  rest  follows  by  symmetry. 


V.  THE  MINIMAL  THEORY;  g(x,y)  = x + 2by 

This  case  corresponds  to  the  equation  (2  ) of  page  3,  for  it  the 
equations  (U)  may  be  written 


du 

dt 


V 


u - 2bv 


(6) 


We  must  distinguish  two  subcases:  (i)  Ibl  < 1,  (ii)  Ibi  2 1 , These  two 
subcases  display  essentially  different  kirds  of  qualitative  behavior, 

A.  Ibl  < 1 (THE  SPIRAL  GaSE) 

When  Ibl  < 1 it  is  well  known  that  the  P 5j.-stem  corresponding  to 
(6)  consists  of  spirals  moving  clockwise  around  the  focus  (1,0),  If  b>0 
the  spirals  move  in  towards  the  focus  (which  is  therefore  stable),  while 
if  b < 0 the  spirals  move  outwards,  and  the  focus  is  unstable.  If  b = 0 
the  spirals  degenerate  to  a family  of  circles  and  the  focus  becomes  a 
center.  The  N- system,  may  be  obtainca  by  translating  t.hn  P~system  two  units 
to  the  Left  along  the  x-axis.  It  wil’.  be  convenient  to  apply  a linear 
transformation  to  tne  variables.  Let  a +\/l  , then  the  trar.sf orma- 

tion  is 


X = u + bv  , y =-  av  (7) 

This  transformation  leaves  the  axis  of  abscissae  pointwise  invariant;  in 
fact,  since  it  simply  represents  a change  to  an  oblique  coordinate  system, 
it  leaves  all  the  essential  properties  of  the  original  system,  undistu.ubed: 
PN-ccmers  remain  such,  simple  curves  remain  so,  etc.  Most  important,  the 


time  length  of  a given  path  is  not  changed  by  the  transformation.  It  will 
therefore  suffice  to  consider  the  x,y-plane  only;  any  results  obtained  ther 
may  be  applied  in  the  u,v-plane  by  invoking  the  inverse  of  (7).  Under  (7) 
the  original  system  (6)  becomes 


dX  u a.  L. 

dt  bx  * ay  - b 


|l=-ax-byia 


(S) 


The  solutions  of  (d)  (i.e,,  the  P-  and  N-curves)  are  given  by: 


/,x  “bt/»  iat  . „ -iatv  + , 

x(t)  = e (Ae  + Be  ) - 1 

f.s  . -bty,  iat  T,  -iatv 
y(t)  = le  (Ae  - Be  ) 


(9) 


where 

A = i [x(0)  - iy(0)  ^ l]  , B = A . (10) 

(Here  and  throughout  the  sequel  the  upper  sign  pertains  to  the  P-system, 
the  lower  to  the  N-system. ) The  fvinctions  (9)  represent  ordinary  loga- 
rithmic sprials  or,  if  b = 0 , circles. 

The  main  result  for  the  present  case  is  embodied  in  Theorem  7;  but 
since  this  is  by  far  the  most  involved  case  to  be  considered,  the  proof 
will  be  broken  up  into  a number  of  parts. 

Lemma  1,  The  length  of  a P-  or  N-arc  cut  off  from  the  corresponding 
kind  of  curve  by  two  successive  intersections  with  the  x-axis  is  always 
n/a  , 

Proof,  By  (9)  and  (10),  a given  P-  or  N-curve  intersects  the  x-axis 
when  Ae^*^^  = Be~^°^^  = "Ae^^  or,  otherwise  written,  when  Im  ■^Ae^^^^  = 0. 
Expanding  this  expression  gives 


For  any  but  the  singular  solutions  (the  foci),  the  quantity  on  the  right 


is  well  defined  or  inf inite j and  since  tan  at  taKes  on  every  such  value 
at  intervals  of  n/a  , the  statement  follows. 


Now  a certain  sequence  of  points  on  the  x-axis  is  to  be  defined,  as 
follows;  First,  take  = 0 , If  one  starts  at  ( = (0,0)  and 

follows  the  P-curve  from  that  point  for  the  time  -n/a  one  reaches,  by 
Lemma  1,  a certain  point  ( on  the  x-axis.  If  one  then  follows  the 

N-curve  from  ( £^,0)  for  the  same  length  of  time,  one  reaches  a point 
( Alternately  following  P-  and  N-arcs  of  length  -n/a  in  this 

way,  one  gets  the  sequence  ( which  was  to  be  defined. 


T n c ~ f nbn/a 

Lemma  2.  ^ (® 

Proof,  By  complete  induction, 
the  origin  has  the  equations 


1 


+ 


2 


kbn/a,. 
e ) 


n 1,2, , 


By  (9)  and  (10),  the  P-curve  through 


x(t)  = 1 - e ^^cos  at  , y(t)  = e ^^sin  at 

Putting  t = -n/a  gives  1 + , as  the  formula  requires  (the 

sum  ^ being  of  course  vacuous  in  this  case.)  Now  let  it  be  assumed 

that  the  formula  holds  for  n = m j it  will  be  proved  for  n-m+l. 

( ^ ^T»0)  is  the  end-point  of  an  arc  whose  initial  point  is  ( C >0)»  s-ad 

niT'-L  HI 

which  is  a P-  or  N-arc  according  as  m is  even  or  odd.  Thus  the  equa- 
tion of  the  arc  is 


x(t)  = 


-bt/,  iat 
e (Ae 


_ -iat. 
Be  ) 


(-1) 


m 


where 

A = B - + 1)  . 

If  one  combine  these  equations  with  t * -n/a  and  the  assumed  value  of 

i , the  result  is 
m 

- x(-nA)  - (-1)"  » 22^/'“’"''“)  ♦ l] 


which  was  to  be  shown. 
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The  letter  F will  be  used'  to  denote  the  P-arc  origin 

with  ( by  the  first  lemma,  the  length  of  F is  n/a  . 

Lemma  3*  If  P € R > where  R the  set  which  consists  of  the  in- 

ternal 0 < X < on  the  x-axis  and  the  Interior  of  the  region  bounded  by 
this  interval  and  F , and  ^ ^ ^ that  path  from  p which  is  obtained 

by  following  tt^  N-curve  through  p ^ F and  then  following  F into 
the  origin,  ti/o  . 

Proof.  It  will  certainly  suffice  to  prove  this  for  points  on  the  axis 
only.  To  get  what  we  want  it  will  be  convenient  to  find  the  coordinates  of 

p and  q by  working  out  from  the  origin.  The 


0 p ^1  equations  of  F in  exponential  form  are 


The  point  q is  given  by  these  equations  when  t has  some  value 
(0  X ' n/a); 


1 bX,  iaX  -iaX.  i b\,  iaX  -iaXx 

--e(e  +e  ) iyq=2®(®  "®  ) 


(12) 


Regarding  q as  the  initial  point  of  the  N-arc  pq  , the  time  necessary  to 


reach  p being  - yj.(0  < /r.  « n/a)  , 

* 1 

y > 0 - 

P 


one  likewise  obtains: 

f 

(13) 

S 


where  A = i(x  +l-iy),  B=A, 

^ q •'q" 

The  equation  (13)  implies  that  Ae  is  real.  Putting  its  imagi- 

nary part  equal  to  zero  and  using  (12)  one  gets 

e^^sin  a(X  + ^ ) = 2 sin 


Because  0 < a/r.  < n , the  right  side,  and  therefore  the  left  side  of  this 
equation  is  positive.  Tnis,  together  with  the  inequality  0 < a(X  + ^ ) < 2n  , 
implies  0 < a(X  + ) < n , which  was  to  be  shown. 


R-U69 

-3k  ~ 


% 


f 

^r 

5 

I 

I 

1 

i 
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With  the  aid  of  these  three  little  lemmas  it  is  possible  to  start 
finding  minimal  paths.  As  in  the  proof  of  Theorem  2,  the  procedure  will  be 
to  examine  one  part  of  the  lower  half -plane  after  another  until  a unique 
minimal  path  from  every  point  in  this  region  has  been  found;  the  rest  will 
follow  by  symmetry. 

One  more  definition;  A P-path  is  a path  which  begins  with  a P-arc, 
an  N-path  with  an  N-aro. 

Theorem  3.  ^ p ^ ^ point  on  the  Interval  0 < x < 1 (y  = 0), 

then  given  Oiiy  P-path  ^ from  p which  is  of  length  < n/a  one  can  find 
an  N-path  from  p which  is  shorter  than  ^ , 

Proof . ^ must  be  of  the  following  type:  it  begins  (by  asw'umption) 
with  a P-arc  pq  . This  cannot  return  to  the  x-axis,  for  if  it  did  it  alone 

(and  therefore  the  whole  path)  would  have  a length  3 n/a  , by  Lemma  1;  and 

this  has  been  precluded.  From  q , A follows  some  N-arc  qr  which  crosses 
the  x-axis  but,  for  the  same  reason  as  before,  does  not  cross  it  again.  (That 
it  does  cross  the  x-axis  once  may  be  seen  as  follows;  it  cannot  stop  short  of 
the  x-axis,  or  on  the  interval  0<x<l,for  this  wouHd  give  a corner  of 
the  wrong  kind;  and  if  it  stopped  on  the  axis  to  the  right  of  the  point  (1,0), 
the  succeeding  P-arc,  by  the  corner  condition,  would  necessarily  return  to 
the  axis  and  therefore  have  a length  > n/a  .)  The  N-arc  qr  is  followed  by 
a P-arc  which  must  likewise  cross  the  axis  at  some  point  s . The  point  s 
lies  to  the  left  of  p , for  otherwise  A could  not  reach  the  origin  frcm  s 

without  crossing  itself.  What  A may  be  like  beyond  s does  not  natter. 


If  X = ^(pq)  , H-  ~ (q^*)  » ^ . then  in  terms  of  these 

variables  one  can,  by  repeatedly  applying  (9)  and  (10),  get  the  coordinates 
of  the  points  q , r , s ; and  from  the  resulting  expressions  one  obtain?, 
upon  elimination, 


, -yo- 


([(Xj,  - 2}  , y = b 


ai 


Ok) 


Our  object  will  be  to  show  that,  by  holaing 
p and  s fixed  arri.  reducing  X to  0 , 
we  obtain  a path  which  is  shorter  than  A , 
For  this  purpose  we  shall  prove 

(X  + + o-  ) > 0 
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Since  p and  s are  to  be  held  fixed,  X can  be  taken  as  the  sole  inde- 
pendent variable  in  (li;)i  and  differentiating  both  sides  of  (IR)  with  respect 
to  X leads  to 


d 

dX 


(X  + ft  ♦ O'  ) 


yo"  dcr  _ 

(x  - l)e~  + 2 
P 


(15) 


(The  denominator  does  not  vanish;  for  when  b / 0 the  fact  that  0 < X < n/a 
implies  that  it  cannot  even  be  real;  and  when  b = 0 it  is  the  sum  of  two 
positive  terms.)  This  derivative  is,  of  course,  real;  putting  its  imaginary- 
part  equal  to  zero  gives 


^ [~2  sin  aft  + (x^  - l)e  ^^sin  a(X  -J-  ft  ) ] = (l-Xp)e  ^^sinaX  . (l6) 

One  can  use  this  to  eliminate  from  the  right  member  of  (15)  or,  what 

is  the  same,  from  its  real  part.  The  result  so  obtained  is 

2 sin  a/t[2  Mx  - l)e*'"^]  2 

^(X  + ft  +cr  ) = TkI 

1 2 + (Xp  - l)e  ^ 1 j^2  sin  aft  + (x^  - l)e  sin  a(X  +M)] 


It  follows  from  (16)  that  the  second  term  in  the  aoove  denominator  is  posi- 
tive; thus  the  derivative  has  the  sign  of  sin  aft  . When  ft  has  its  origi- 
nal value,  this  is  positive,  since  0<X+ft  + a < n/a  . So  as  X is  de- 
creased the  whole  sum  X + fi  + <r  decreases,  /t  remains  less  than  n/a  , 
and  the  derivative  remains  positive.  Thus  we  may  shorten  ^ by  aecreasing 
X ; and  x may  be  decreased  without  changing  the  topology  of  the  situation 
until  one  of  two  things  happens: 

I.  r comes  into  coincidence  with  s . But  this  is  impossible,  for 
if  it  did  occur  the  shortened  path  would  then  contain  an  N-arc  qs  which, 
since  it  intersects  the  x-axis  twice,  is  of  length  5 n/a  , This  would  con- 
tradict the  assumption  that  t (A)  < n/a  , 

II,  q comes  into  coincidence  with  p ; that  is,  X goes  all  the  way 
to  zero.  This  is  Just  what  was  intended. 


Corollary  3»1»  ^ P £ R (see  Lemma  3),  the  conclualon  of  Theorem  3 


remains  true. 


Proof . It  is  easy  to  see,  by  arguments  like  those  used  above,  that  a 

sufficiently  short  P-path  from  p must  be 
of  the  type  pp  < qrs . . . shown  in  the  ad- 
joining sketch.  By  the  proof  of  the  theorem, 
one  can  show  that  decreasing  > - v(p'q) 
shortens  the  whole  path.  This  process  may 
be  continued  until  one  of  two  things  happens: 
1 (described  above)  --  which  is  impossible  for  the  same  reason  as  before  — 
or: 


II'.  The  arc  qr  comes  to  contain  the  point  p . But  in  this  case 
we  get  a path  which  is  shorter  than  the  original  and  consists  of  the  closed 
loop  pp'q’p  and  an  N-path  from  p . By  simply  eliminating  the  loop  we  get 
a still  shorter  path  of  the  kind  sought. 

Corollary  3.2.  IT  p £ R , path  A from  p such  that  t(^)  < n/a 
vhlch  does  not  begin  with  an  N-arc  intersecting  F may  be  replaced  by  a 
shorter  one  which  does. 

Proof . Let  q be  the  initial  point  of  the  first  P-ai'c  of  A , 3y 
assumption,  q £ R , By  applying  Corollary  3.1  to  q , we  may  shorten  A 
by  replacing  q...  with  a path  from  q which  begins  with  a non-vacuous 
N-arc  qq^  . If  q^  ^ R , there  is  nothing  more  to  do;  if  q^£  R , the 
procedure  is  to  be  repeated.  Sooner  or  later  the  point  q^^  , which  is  the 
initial  point  of  the  first.  P-arc  of  the  path  after  n such  modifications, 
must  lie  on  or  below  F j for  each  point  q is  like  r'  (see  the  last 
sketch),  and  if  all  the  points  q^^  lay  in  R , then  all  the  corre.sponding 
points  like  s on  A would  lie  to  the  right  of  the  origin,  which  A could 
therefore  never  have  reached,  (It  should  be  observed  that  infinitely  many 
points  q^  cannot  occur,  for  if  they  did  this  would  imply  the  existence  of 
infinitely  many  pieces  like  p-qrs  in  A ^ each  outside  the  preceding  one; 
and  this,  as  it  is  not  difficult  to  prove,  would  imply  that  the  length  of 
the  whole  path  A was  infinite. 

Theorem  U-  If  p £ R , the  unique  minimal  path  from  p ^ obtained 
^ following  the  N-curve  through  p F ^ then  foll.owlng  F into 
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the  origin*  p 6 F , the  unique  minimal  path  from  p ^ obtained  simply 

by  following  F into  the  origin. ) 

f 


Proof.  By  Lemma  3*  we  know  that  there  exists  at  least  one  path  from 
p the  length  of  which  is  less  than  n/a  { thus  we  need  consider  only  paths 
satisfying  this  inequality.  Then,  in  view  of  Corollary  3.2,  we  can  further 
restrict  our  attention  to  those  paths  from  p 6 R which  follow  the  N-curve 
from  p at  least  until  it  reaches  F , We  shall  assume  that  p lies  on  the 
x-axis;  for  if  the  theorem  is  proved  for  this  case,  it  will  automatically 
follow  for  any  point  on  the  N-arc  connecting  p with  F , and  the  totality 
of  such  points  is  (as  p ranges  over  the  interval)  R , 

Let  A be  such  a path.  It  begins  with  an  N-arc  pq  which  crosses  F 
but  does  not  return  to  the  axis,  for  this  would  make  A too  long.  The  cor- 
ner q is  followed  by  a P-arc  qr  which  crosses  the  axis  once  (it  cannot 
stop  there  — unless  q = 0 — for  this  would  force  the  succeeding  N-arc  to 
return  to  the  axis,  and  this  would  again  make  A too  long)  but,  for  the 
same  reason,  does  not  return  to  it.  The  N-arc  of  A fron  r intersects 
the  axis  at  some  point  s j what  A may  do  beyond  s will  be  irrelevant. 


q 


The  rest  of  the  proof  follows  that  of 
Theorem  3 almost  to  the  letter.  If  one  puts 
\ = r(pq)  , fj.  = r(qr)  , and  a = r(rs) 
and  computes,  using  (9) -(10)  repeatedly,  the 
coordinates  of  q,  r , and  s in  temis  of 
\ , /X  , o-  , and  the  coordinates  of  p , one 
gets,  after  a little  manipulation 


Xg  + 1 = e"^®"  |[(Xp  + l)e'^^-  2]  e"^^  + 2 j-  ( y = b - ci)  (17) 


Differentiating  with  respect  to  X , one  then 


2 


OlK 

- (x  + l)e 
P 


(la) 


which  corresponds  to  (lU). 
gets 


(As  before,  the  denominator  does  not  vanish. ) Again  separating 
imaginary  parts,  putting  the  latter  equal  to  0,  and  eliminating 
to; 


real  and 

lead 


(Xp  + l)e  ^^sin  a(\  + ^ )]  = (Xp  + l)e  *^^san  aX  (1?) 

2 sin  a/i  ["(x  + l)e  - 2l  ^ 

(20) 

(Xp  + l)e  ^ I jj2  sin  a/x  - (Xp  !-l)e  sin  a(\ +/i)j 

As  in  the  earlier  proof,  (19)  shows  that  the  second  factor  of  the  denomina- 
tor in  (20)  is  positive;  therefore  (20)  implies  that  the  derivative  has  the 
same  sign  as  sin  a/i  . This  quantity,  however,  is  positive,  for 
0<ft<  X + M + <7<  n/a  , Therefore,  as  X is  decreased,  so  is  X +/i  + cr 

(i.e.,  the  path  is  shortened)  and  sin  ap  remains  positive.  If  X is  de- 

creased until  q lies  on  F , so  that  qr  contains  the  origin,  and  if  all 
the  shortened  path  beyond  0 is  cut  off,  a still  shorter  path  from  p is 
obtained,  and  it  is  exactly  the  path  whose  minimality  was  to  be  proved. 

This  completes  the  proof. 

Theorem  h really  constitutes  the  first  step  in  an  inductive  argument, 
the  whole  of  which  will  give  the  ma'n  result. 

Theorem  5.  ^ P £ point  on  the  interval  | {^1  < x ^ ^^n+1'  ’ 

then  the  unique  minimal  path  from  p that  which  consists  of  n + 2 

arcs,  the  first  of  which  is  an  K-arc  of  lengt>i  X (0  $ X < n/a),  the  last 

of  which  is  of  length  <r  {0  < <r  $ n/a),  and  the  .'.■'it  erven  lug  ones  of  which 
are  all  of  length  n/a  , (It  wili  be  shown  ohau  a path  of  this  kind  really 
exists  and  is  unique.) 

Proof ■ The  proof  will  be  by  induction  on  n , n ~ 0,  1,  2, . . . . 

Theorem  gives  the  desired  result  for  n = 0 , where  the  interval  is 
( ij)  ~ ^1^  * this  case,  of  course,  the  intervening  arcs  of 

length  n/a  do  not  occur. 

The  next  step  in  the  proof  will  be  to  determine  the  locus  of  the  cor- 
ners belonging  to  the  paths  described  in  the  statement  of  the  theorem.  This 
will  serve  us  in  several  ways.  It  is  clear  that  the  corners  must  be  ob- 
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tainable  in  the  following  way:  one  starts  at  the  origin  and  moves  along  f 
(or  , the  reflection  of  F in  the  origin)  for  some  interval  of  time 
- O' (0  < O'  ^ n/a),  then  turns  onto  an  N-(P-)  arc  and  foDlows  it  fr-r  a time 
-n/a  , thence  follows  the  succeeding  P-(N-)  arc  for  the  same  length  of  time, 

+ K 

and  continues  to  follow  alternating  P-  and  N*-arcs  until  the  n " such  has 
been  traversed.  The  ends  of  these  n^*^  arcs,  as  a ranges  over  its  inter- 
val, describe  a certain  curve  E , wnicn,  I claim,  is  thus  the  locus  of 
the  first  corners' on  the  paths  described  in  the  theorem,  it  is  on^-y  neces- 
sary to  verify  that  this  curve  E is  in  the  right  place.  To  uo  this, 
■parametric  equations  for  will  be  derived.  If  tJie  i;  corner  from  the 

origin  on  the  path  described  above  is  (x  , (cr  ),y  (c  ))  , wiiere  h =■-  0 

nh  nn 

gives  the  corner  on  F or  F~  , then 

x^(o-)  = (-l)^'^^(p^C  + y^h^  ^ ^ (2:1) 

where  n = 0,  1,,..;  h - 0,  l,...,n;  p = ^ 1 - cos  acr  ; 

r)  = -e^*^  sin  acr.  (For  see  Lemma  2.) 

One  proves  (21)  by  induction  on  h , The  number  n is  really  perti- 
nent only  in  that  it  determines  whether  the  path  concerned  begins  m F 
or  on  F~  ; when  n is  even,  it  begins  on  F , and  when  odd  on  F . Thus 


X ( 

no 


) = (-1)^(1 


bcT 


cos  a O' 


). 


O'  ) 


- r 

- t -1/ 


bcr 


•sin  a a 


(Cf.  (11),  page  33.)  In  these  equations  we  have  (21)  for  h - 0 . Now  as- 

siune  that  (21)  holds  for  h = m,  (x  ,)  is  the  end  of  the  arc 

of  length  -n/a  from  (x  ,y  ) and,  as  one  can  easily  /erify,  is  a P--arc 

nm  run  j 

if  n + m is  odd,  an  N-arc  if  n + m is  even.  Thus  (9)  and  (10)  give 


" •P<'’  * 


n+m 


9 y ^-1  ( O'  ) - ip(B  ■■  A) 


where 


I 


(-1) 


n-^-m 


B = A 
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lt one  combines  these  equations  with  (21)  for  h = m , and  uses 
Lemma  2,  one  obtains 

as  claimed. 

In  particular. 


and  these  are  the  equations  of  . Furthermore, 


X (0)  = 1C  I , X (a/a)  - p^(l  + p)  + !C  I 
nn  n nn  ^m+1 


y (0)  “ y (Jt/a)  - 0 

•'nn  •'nn  ' 


From  this  it  is  easv  to  see  that  E is  a simple,  semi -circi.e- like  curve 
which  lies  below  the  x-axis  and  whose  end-points  are  those  of  the  interval 
i C X ^ ^ ^ n+1^  " ^n  fact,  the  curve  obtained  by  magnifying  T 

by  the  factor  and  then  translating  the  result  to  the  right  for  a dis- 

tance iC^l . Now  it  is  a simple  matter  to  show  the  existence  and  unique- 
ness of  the  paths  in  question.  For  the  existence,  one  needs  only  observe 
that  the  path  obtained  by  going  from  p to  along  an  N-arc  and  then 

following  the  cumre  which,  by  the  above  construction,  defined  the  point  on 
so  attained,  will  suffice.  On  the  other  hand,  any  path  of  the  tjpe 
described  must  intersect  E with  an  N-arc  from  p , and  bevond  this  in- 
tersection  behave  like  the  path  just  described;  but,  for  permissible  values 
of  1 , the  N”curve  from  p intersects  only  once;  and  this  gives  the 

uniqueness. 
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Now  the  proof  proper  can  be  begun.  As  before,  one  sets  cut  by  de- 
ciding how  a path  from  p must  behave.  It  begins,  let  us  say,  with  an  N- 
arc  of  length  X , where  X > 0 . We  shall  momentarily  assume: 

X $ n/a  (23) 

Thus  the  first  arc  of  the  path  ends  at  a point  q on  or  below  the  x-axis, 
and  from  q the  second  arc  qr  , a ?-arc  emerges.  The  arc  qr  must  cross 
the  axis,  but  more  will  be  assumed; 

The  arc  qr  intersects  the  x-axis  in  the  interval  -iC^l  $ x<  - if^  (21;) 

The  two  assumptions  (23)  and-  (21;)  will  be  justified  at  the  end  of  the  proof. 

Since  the  shortest  path  from  p is  being  sought,  we  can  also  assume 

(and  this  requires  no  further  justification)  that  once  the  path  crosses  this 

interval,  say  at  the  point  v , it  coincides  with  the  shortest  path  from  v , 

which  is  given  by  the  inductive  assumption  and  symmetry;  for  if  it  did  not 

it  could  be  replaced  with  one  which  did,  and  which  would  certainly  be  shorter 

than  the  given  one.  For  such  a path,  therefore,  r must  be 

(x  ^(rr  j,y  (cr))  for  some  value  of  cr  which  depends  on  p and  X . 
n^n-x  n^n-i 

In  other  words,  by  (21), 

n— 1^  b r"  „ I I ^ I n— 1 be 

x^  - p (e  cos  aa  - 1)  - I ^ 

If  the  length  of  qr  is  fi  , then  using  (9)  and  (10)  for  the  P-arc  of 
length  - fx  with  the  initial  point  r one  gets: 


where 


A = - 1 - iy^.) 


, E = A , 
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"Then  following  the  N-curve  from  q for  the  time  -X  , one  gets 


bX,_  -ioX  ^ j,  iaXv  t a - r, 

Xp  ^ e (Ce  +06  J - 1 f 0 “ Yp  ■ ie  (Ce  - De  ) 


where 

C * + 1 - iy^)  , D = C . 

The  result  of  combining  a31  these  equations  and  using  Lenuna  2 is 

Xp  + 1 = e^^  [(p”  ^ - £■  p^)e^^  + 2]  (25) 

where,  as  before,  p = and  y =•  b - ai  , Our  problem  is  to  select 

the  shortest  paui  from  those  described,  which  satisfy  (25).  The  length  of 

any  such  path  is  T = \ * fi  ■*  a * (a-l)n/a  , Thus  the  problem  reduces 

to  minimizing  T with  respect  to  (say)  X , x^  being  held  fixed.  The 

range  of  variation  of  X for  a fixed  x is  05X5X  , where  X is 

p o'  o 

that  value  of  X corresponding  to  cr  = n/a  , From  this  point  on  the  ar- 
gument wiil  follow  famiiar  lines.  Tlie  result  of  differentiating  both  sides 
of  (25)  with  x-espect  to  X and  suitably  rearranging  the  terns  is 
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S ■ ® ° , " ■ 

2 (S"  p‘')  - 

The  denominator  does  not  vanish  for  those  values  of  b and  u which  have 
been  admitted.  Upon  separating  the  real  and  imaginary  parts  of  the  right 
member  of  (26),  and  setting  the  latter  equal  to  zero,  one  gets: 


k\  do-  _ 2(lp^)  e 
1^0  P / dX 


sin  gp.  - 0 e 

n-1  bcr  i 
p e sin  a a 


n-1  b(cr  - n ) . . 

e ^ sin  a{  fi  + c } 


Thus  the  derivative  (26)  exists  and  is  continuous  for  0 c t < n/a  , Com- 
bining (26)  and  (27)  yields  the  final  relation 


dT  ^ 2[it(Zp^)^-ij  (Z  p^)  p^  ^ e^^^ 

n-1  \>{<T  + fi  ) . I 

p e sin  ao- 


cos  aa 


^ „2(n-I)^^P<r] 


Sin  a I 


n-1  b(o-  + /i  ) . I ^ 4-1  yo-|2 

e sin  a 0-  <^  ( Z p ! - p e'  j 


It  is  easy  to  see  that  all  the  factors  involved,  except  for  sin  a/x  , are 
necessarily  positive;  therefore  it  is  once  again  true  that  the  derivative 
has  the  same  sign  as  sin  a/x  . We  know  that  sin  a/x  = 0 on  , for 

E is  exactly  the  locus  of  q for  p.  - n/a  . For  X smaller,  i.e., 

“ dT 

for  q above  E^,  sin  a/x  and  therefore  — - are  negative;  for  X larger, 

positive.  Thus  ^ = n/a  gives  T its  minimum  value,  as  was  to  be  proved. 

(For  Xp  = > X = 0 and  p.  = n/a  give  the  only  path  from  p of 

the  type  under  consideration;  thus  it  must  be  miinimal,  and  the  assertion  is 

valid  on  the  entire  interval  l£l<x$l^  ,1.) 

n '‘n+1 

The  proof  will  be  complete  as  soon  as  the  two  assumptions  (23)  and 
(2k)  have  been  justified. 

The  assumption  (23)  will  be  justified  by  showing  that  any  path  ^ 
from  p which  violates  (23)  may  be  replaced  by  a shorter  path  which  sat- 
isfies it.  If  (23)  fails  to  hold  for  A , this  means  that  the  initial 
N-arc  of  A not  only  returns  to  the  x-axis  (on  the  negative  half)  but 
crosses  it.  Let  v be  the  point  at  which  this  happens.  The  claim  is 
that  if  one  "shunts  out"  v by  means  of  a short  P-arc,  a shorter  canonical 
path  is  obtained.  That  such  a shunting  can  be  performed  follows  from  the 


fact  that  at  a point  like  v the  curvature  of  the  P- 
curve  is  definitely  less  than  that  of  the  N-curve| 
this  may  be  inferred  from  the  given  differential  equa- 
tions. If  the  P-arc  introduced  is  short  enough,  it 
cannot  cross  A at  any  points  but  those  indicated,  so 
that  it  does  not  destroy  the  caijonical  nature  of  the 
path.  That  the  path  obtained  in  this  way  satisfies 
(23)  is  obvious;  so  it  only  remains  to  show  that  a 
shortening  is  truly  effected. 

This  will  follow  from  the  resui.T.;  ^ P ~ i ^ tr)  ) , where  { < -i  , 
is  a point  near  the  x-axis,  and  if  k and  X'  are  the  lengths  of  the 
shortest  P-  and  N-arc  respectively  leading  from  p ^ uhe  x-axis,  then 
X < X'  , Namely,  if  the  shunting  F-arc  is  sufficiently  short,  then  both 
a and  b (see  the  above  picture)  will  satisfy  the  requirements  on  p ; 
thus  the  above  proposition  will  imply  that  av'  is  shorter  than  av  , and 
that  v'b  is  shorter  than  vb  . Taken  together,  these  show  that  av'b 
is  shorter  than  avb  , which  was  to  be  proved. 

To  prove  the  result  stated  above,  regard  p as  the  initial  point  of 
the  respective  arcs,  (Suppose,  for  the  moment,  that  p lies  below  the 
x-axis.)  By  using  (9)  and  (10),  and  the  fact  that  the  end  points  of  the 
arcs  are  on  the  x-axis,  one  gets  that  the  quantities 

[(  { - 1)  - ii7  J and  [(  { + 1)  - ±r}] 

are  real.  Setting  their  imaginary  parts  equa]  to  zero  gives 


tan  aX 


-1 

-t  * 1 


tan  aX ' 


■ V 

-T“i 


Since  tan  x is  a monotone  increasing  function  at  points  of  continuity, 
and  since  X and  X'  are  small, 


__13_ 


implies 


X < X' 


as  was  to  be  shown.  If  p lies  above  the  axis,  the  quantities  rj  , X , 


and  X'  change  sign,  but  these  sign  changes  just  cancel  out  to  give  the 
same  conclusion. 


The  assumption  (23)  has  thus  been  justified;  in  what  follows  (the 
justification  of  (2ii))  it  will  accordingly  be  assumed  that  all  paths  con- 
sidered satisfy  this  assumption.  In  defense  of  (2U),  it  will  be  shown  that 
any  path  for  which  (2ii)  fails  must  be  longer  tnan  , the  path  which,  ac- 
cording to  the  theorem,  is  minimal. 


So  let  A be  such  a path.  It  begins  with  an  N-arc  of  length  X 
(0  < X $ n/a)  . If  this  arc  ends  at  the  point  q , then  q is  the  initial 
point  of  a P-arc  which,  by  the  argument  adauced  in  support  of  (23)  and 
symmetry,  may  be  supposed  to  stop  short  of  crossing  the  positive  half  of 
the  x-axis.  If  r is  the  terminal  point  of  this  arc,  there  follows  an 
N-arc  starting  at  r which  goes  at  least  as  far  as  the  positive  half  of 
the  x-axis  (say  at  the  point  s ).  It  may  be  that  r -■=  s . It  follows 
from  the  definition  of  £ that  If  ^1  < x , On  the  other  hand,  since 

A does  not  cross  itself, 


X < X < 

s p 


1C 


n+1' 


What  happens  to  A beyond  s is  not  certain;  but  sooner  or  later,  in 
order  to  reach  the  origin,  A must  cross  one  of  the  two  intervals 


- If 


n 


X < - [{ 


n-1' 


(For  by  arguments  of  the  type  already  given,  A can  cross  neither  the  N- 
arc  joining  (~  lC^i,0)  with  ( P-arc  joining  ( ,0) 

with  (••  ^1,0),  Since  these  two  arcs,  together  with  the  two  intervals 

just  described,  form  a simple  closed  curve  surrounding  the  origin,  the 
remark  made  must  be  true.)  For  momentary  convenience,  let  N(A)  be  the 
number  of  times  A crosses  the  x-axis  after  leaving  p and  before 
crossing  one  of  these  two  inteiT/-als.  (E.g.,  N(A)  =0  would  mean  that 
^ satisfies  (2U).)  We  next  prove;  ^ p ^ the  interval 

1C  I < x $ i C .1 1 , then 
n p ^ n+1  ’ 


R-h69 

-u6-' 


<r(A  ) < 
a ^ p'  a 


(^8) 


According  to  a formula  on  page  h2 


r (^p)  = ^ + cr  + nn/a 


Thus  it  is  only  necessary  to  get  bounds  for  \ + c 
in  (25),  one  obtains: 


Upon  putting  ft  = r./a 


^ 1 ^ e>^  [2  (i/) 


Since  the  imaginary  part  of  the  right  member  vanishes,  we  have 

n b(7  . ,,  V ,.  / 5 k\  . 

p e sin  a(.X  + cr  ) - ^ I X p ) sin  a\ 


But  on  A , 0 ^ X < rt/a  and  0 < X + cr  < i;n/a  : these  facts  and  the  aoovo 

p , 

equation  imply  0 < X + cr  $ n/a  which,  in  turn,  implies  (2o). 

We  can  say  first  that  if  N(A)  5 3 , then  A is  longer  than  A ^ , 

For  when  N(A)  ^ 3 , A c.egir,s  with  a curve  pqrs  as  cescribed  above, 

and  3 is  followea  by  a curve  sq'r's’  of  the  samie  sort.  If  v and  v* 
are  the  points  where  the  arcs  qr  and  q'r'  intersect  the  .x-axis,  one 

can  show  that  each  of  tee  four  pieces  pqv  , vrs  , sq'v'  , and  v'r's' 

has  a length  greater  than  n/2a  ; and  since  the  length  of  that  part,  of  A 
following  s'  is,  by  the  inductive  assumption  and  (28),  greater  than 
(n  -•  l)T(/a  , the  length  of  A itself  is  greater  than  (n  + l)n/a  which, 
by  (28),  is  in  turn  greater  than  ^('^p)  • Thus  A is  too  long. 

So  the  cases  N(A)  - 1 and  N(A)  2 are  the  only  ones  left. 


Consider  pqv  . If  X and  ft  are  the  lengths  of  pq  and  qv  re- 
spectively, then  working  out  the  coordinates  of  v give.s  the  equation 

2 cos  aft  - (x^  + l)e^^  cos  a(X  + ft  J - 1 - x^ 

If  it  were  true  that  X + ft  < n/?a  , tlien  the  left  member  would  be  < t , 
since  cos  a(X  + p.  ) > 0 , while  the  right  member  would  be  > 2,  for 
x^  < " 1 ^^1  < -1  . This  contradiction  proves  the  assertion  for  pqvj 
vrs,  etc.,  are  subject  to  the  same  argument. 


When  N(A)  » 1 , ^ ^ 

nation  of  f , can  be  made  to  jraply: 
n 


ly- 


This,  together  with  the  defi- 


- I£_-1  S 3C  « -|£  i 
n+1  V n 

In  other  words,  v is  a point  like 
p , only  on  the  opposite  side  of 
the  origin.  The  part  of  A lying 
beyond  v provides  a path  from  v 
which  (because  of  the  assumed  posi- 
tion of  s ) is  of  the  type  con- 
sidered in  the  main  part  of  this 
proof.  It  is  therefore  not  shorter 
than  , the  path  given  by  the 
statement  of  the  theorem  and  sym- 
metry as  the  minimal  path  from  v . 
It  will  thus  suffice  to  prove  that 


the  path  A ' obtained  from  A by  replacing  the  part  beyond  v with  A ^ 

is  longer  than  ^ ^ . A minor  modification  of  the  argument  on  pages  Ul,  U2 

ll3  will  do  it.  Here  we  have  exactly  the  same  gecmetrical  situation  as 

there,  except  that  it  is  as  if  the  curve  began  from  the  interval 

l£  -I  < X S IC  J while  in  fact  p is  closer  to  the  origin.  These  ' 
n+X  p n^2 

modlficaticns  do  not  vitiate  the  conclusion  that  ^ being  the  time 

length  of  A'  as  a function  of  X = r(pq))  has  the  sign  of  sin  aji  . 
Here,  however,  sin  afi  > 0 , since  fi  < n/a  . { fj.  - n/a  puts  q on 
E , ; so  if  u 5 > <1  would  lie  above  this  curve  and  force  p to 

lie  in  < x $ dT  ^ ^ ’ and  by  deforming  A'  by 

decreasing  X (and  keeping  the  path  otherwise  of  the  same  type)  one  ob- 
tains a shorter  path  from  p « The  process  may  be  continued  until  the 
path  goes  through  (-  but  as  soon  as  this  happeni.  the  defomed 

path  satisfies  (2i4)  and  this,  as  we  know,  implies  that  it  (and  therefore 
A itself)  is  longer  than  A^  . This  was  the  desired  conclusion. 


If  N(A)  = 2 , 1^  I < X < X and  - If  I ^ x < - J , where 

^ ' * n 3 p n h n-1  ' 

h is  the  next  intersection  after  s of  A with  the  x-axis.  The  part 
of  A after  v , call  it  A , , is  either  (1)  of  the  type  here  considered 

_ _ V 

Satisfying  (23)  but  not  (2U) 


but  with  N(A)  1 1 or  (2)  of  the  type  considered  in  the  main  part  of  the 

proof*^,  according  as  (1)  - ^ \ < ~ “ ^^n+2^  ^ \ 

(As  before,  because  of  the  position  of  s , v can  lie  no  farther  to  the 
left.)  In  either  case,  is  longer  than  A ^ , so  that  in  place  ^ A we 

may  consider  the  path  A'  arising  from  A by  replacing  A^  with  A^  . If 

- |£  1 < X < - IC  1 then  A'  , regarded  as  a path  from  p , satisfies 

n+1  " V n ^ 

iN(A')  = 1 , and  therefore  is  longer  than  A^  . If,  however, 

- \E  !$x  <-iC  tIj  (2d)  the  length  of  A'  , which  is  greater 

than  that  of  A , is  greater  than  (n  + l)n/a  . But,  again  by  (28),  the 
length  of  A is  less  than  this  quantity.  This  is  what  was  to  be  proved. 

^ p 

In  fact,  this  completes  the  whole  proof  of  Theorem  5. 

Corollary.  If  p lies  above  one  of  the  curves  (see  page  I4.O)  and 

below  y = 0 , ^ unique  minimal  path  from  p ^ that  obtained  b^.  following 
the  path  described  in  Theorem  5 which  passes  through  p . 

Proof.  (See  the  proof  of  B,  page  27.) 


The  time  has  come  to  say  something  about  the  sign  of  the  constant  b , 
and  the  effect  it  has  on  the  problem  we  have  been  discussing.  The  minimal 
path  has  been  found  for  every  point  on  0 < x ^ IC^^I  , n = 1,  2,.,.  . Now 
by  Lemma  2,  if  p = 1 , 


^ p"  . 2 . 1 • (p"  - 1) 


Now  when  b > 0 , p > 1 and  therefore  ICJ  00  as  n oo  ^ i.e.,  the 
intervals  l£  1 ^ IC  cover  the  entire  positive  half  of  the  x-axis. 

On  the  other  hand,  when  b < 0 , p < 1 and  we  have  1 "”p 

n CO  , so  that  in  this  case  the  set  of  intervals  < x $ ^ ^n+1^  only 

covers  0 < x < ^ . (When  b = 0 , p = 1 and  1£  1 - 1 + 2(n - 1) + 1 - 2n* 

A.  * p 

From  this  it  follows  that: 

Vfhen  b ^ 0 there  exists  at  least  one  path  from  each  point  3^  ^le 

plane. 

For  the  minimal  paths  from  points  on  the  positive  half  of  the  x-axis 
sweep  out  the  whole  plane;  thus  a path  from  any  point  may  be  obtained  by 
taking  that  part  of  a minimal  path  through  the  point  which  lies  beyond  it. 


* Satisfying  both  (23)  and  (2U) 
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If  b < 0 this  is  no  longer  true.  If  S denotes  the  interior  of 
the  set  bounded  by  the  P-arc  A of  length  n/a  joining  P = ( X"T^  » 
with  -P  = ^ ^ P ,oj  and  the  N-arc  B Joining  the  same  two  points 

(that  these  arcs  exist  may  be  verified  by  making  a straight  substitution 
in  (9)  and  (10)),  then: 


When  b < 0 , there  exists  a path  from  p ^ and  only  if  p 8 S . 


We  first  show  that  there  exists  at  least  one  path  from  each  point 

p e S . Suppose  that  p lies  below  the  curve  C made  up  of  the  arcs 

E 'and  the  arcs  arising  from  these  by  reflection  in  the  origin.  If  the 
n 

P-curve  from  p is  drawn  backwards  (t-^-co),  it  must  intersect  one  of 
the  arcs  E„  , for  it  surely  crosses  the  x-axis  to  the  right  of  the  origin. 


from  p . The  corresponding  device 
part  of  S , by  symmetry. 


and  without  first  crossing  A , which 
is  itself  a P-arc.  Let  the  first  such 
point  of  intersection  be  q . Then 
if  one  follows  the  N-curve  backwards 
(i.e,,  upwards)  from  q , one  reaches 
a certain  point  r on  the  x-axis.  Now 
the  unique  minimal  path  from  r passes 
through  p , and  therefore  that  part  of 
this  path  beyond  p provides  a path 
works  if  p is  in  the  complementary 


To  show  that  no  path  can  exist  from  a point  outside  S , we  shall 
show  that  no  path  can  cross  the  boundary  of  S going  inwards.  Consider 
B ; it  is  an  N-arc,  and  therefore  no  path  can  cross  it  with  an  N-arc.  but 
no  path  can  cross  it  inwards  with  a P-arc,  for  all  P-curves  crossing  B 
cross  it  moving  outwards.  Proof:  By 

^ = -bx  + ay  - b ^ = -ax  - by  - a , (d) 


the  tangent  vector  at  any  point  (x,y)  on  B has  the  components 
(-bx  + ay  - b,  -ax  - by  - a) , Therefore  the  outward  normal  to  E at  this 
point  has  the  components  (ax  + by  + a,  -bx  + ay  - b)  , Also  by  (d),  the 
tangent  vector  to  the  P-curve  at  the  same  point  is  (-bx  + ay  + b,  -ax  - by  + a). 


The  inner  product  of  these  last  two  vectors,  which  has  the  sign  of  the 
projection  of  the  second  on  the  first,  is 

(ox  + by  + a)(-bx  + ay  + b)  + (-bx  + ay  - b)(-ox  - by  + a)  = 2y  . 

Since  y ^ 0 on  B , this  means  that  the  projection  is  positive,  i.e,, 
that  the  P-curve  crosses  E moving  outwards  (except,  of  course,  at  the 
ends  of  E , where  the  two  curves  are  tangent,)  This  proves  the  assertion 
made  about  B $ since  A is  subject  to  the  symmetrical  argument,  the  proof 
is  complete. 

Theorem  6.  ^ p is  £ point  in  the  fourth  quadrant  and  below  the 
curve  made  up  of  the  pieces  (n  = 0,  1, . . . ) , or  in  the  third  quadrant, 

and  ^ a path  from  p exists,  then  the  unique  minimal  path  from  p is 
that  which  follows  the  P-curve  through  p the  (negative  half  of  the) 
x-axis  and  then  proceeds  according  to  the  proposition  arising  from  Theorem 
5 and  symmetry. 

Proof.  Consider  any  path  A from  p which  might  be  minimal.  It 
cannot  cross  any  of  the  curves  and  must  therefore  (having  followed  a 

possibly  vacuous  N-arc  and  then  a possibly  vacuous  P-arc)  cross  the  nega- 
tive half  of  the  axis,  say  first  at  the  point  v , By  Theorem  5,  we  may 
suppose  that  A coincides  with  A^  beyond  v , Now  one  should  imagine 
A extended  backwards  to  the  axis  by  having  adaed  to  it  the  N-arc  preceding 
• p which  connects  p with  the  axis,  say  at  the  point  p’  , The  resulting 
path  A ' is  of  the  type  considered  on  pages  l|.l-li3  , except  for  the  possi- 
bilities that  A'  may  cross  itself  on  the  arc  pp'  and  that  p'  may  not 
lie  in  the  interval  in  which  p was  there  supposed  to  lie.  Both  of  these 
possibilities  prove  irrelevant.  Moreover,  (the  length  of  the  first 

P-arc  qr  of  A ' , which  is  also  that  of  A ) is  less  than  n/a  , by  the 
position  of  q . Therefore,  if  T'  is  the  length  r(A')  and  X is  the 

length  of  p'pq  (the  initial  N-arc  of  A'  ),  T'  is  a function  of  X 

dT 

and  the  argument  of  pages  hl~ii3  implies  ^ ^ » thus  by  decreasing  X 
one  decreases  T'  . (The  derivative  is  actually  discontinuous  when  v 
passes  through  one  of  the  points  (-  IC^I  »C)  during  this  contraction,  but 
T’  itself  is  a continuous  function  of  X .)  If,  in  particular,  one  de- 
creases \ until  q comes  into  coincidence  with  p and  then  removes  p'p 
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from  A'  , one  obtains  what  must  be  the  shortest  path  from  p , and  is  in 
fact  as  claimed. 


The  main  result  for  the  spiral  case  can  now  be  stated  briefly  and  com- 
prehensively, Let  C denote  the  curve  composed  of  the  pieces; 


n/,-,  b(T  . 

p [Z  - e cos  acr  ) 


+ 1 


0 < cr  ^ n/a,  p 


^bn/ a 


r \ Ti 

y^(cr)  = -p 


b(T  . 

e sin  0(7 


n Oj  Ij  Zy  . . • 


and  of  the  pieces  arising  from  these  by  reflection  in  the  origin.  C 
divides  the  set  S from,  which  paths  can  be  drawn  into  an  upper  and  a lower 
part.  ( S is  as  described  on  page  k9  when  b < 0 , the  whole  plane  in  the 
contrary  case . ) 

Theorem  7.  If  p e S and  p ^ above  (resp,  below)  C , the  imique 
minimal  path  from  p obtained  by*'  following  the  N-  (resp.  P-)  curve  from 
p until  it  reaches  C , then  switching  to  the  P~  (resp.  N-)  curve  through 
the  point  of  intersection,  then  following  this  curve  until  it  returns  to  C , 
then  switching  again,  and  so  on.  until  the  origin  is  attained.  (In  other 
terms,  the  unique  minimal  paths  are  obtained  as  solutions  of  (U)  by  taking 
t^(x.y)  = -1  above  C and  on  that  part  of  C to  the  left  of  the  origin 
(within  S)  and  4>  (x,y)  - 1 in  the  rest  of  S .) 

It  should  be  recalled  that  all  this  is  in  terms  of  the  oblique  co- 

ordinates X and  y , (See  page  30.)  The  result  is  also  valid  for  the 
original  u and  v if  C is  defined  as  follows;  Let  be  the  P-arc 

connecting  the  origin  with  the  point  ( {^^,0),  be  the  curve  obtained 

by  magnifying  E^  by  the  factor  p*^  and  translating  the  result 

units  to  the  right  (n  = 1,  2,...),  and  E~  tne  curve  obtained  by  reflecting 
in  the  origin  (n  = 0,  1,  2,,..)j  then  C = E^^  U ( UE^)  . 

B.  Ibi  > 1 (THE  NODE  CASE) 

In  this  case  the  underlying  equation  is  again 


••  2by 


(6) 
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but  now  I b i > 1 . ( The  marginal  case  ! b I " 1 , where  the  singular  points 
aro  so-called  degenerax,©  nodes,  is  more  closely  related  to  this  case  than 
to  the  spiral  case,  but  for  technical  reasons  the  proofs  which  follow  do  not 
happen  to  be  directly  extensible  to  it.  However,  the  same  general  line  of 
reasoning,  appropriately  recast,  can  be  made  to  cover  this  case  too.  Becatise 
the  amount  of  recasting  necessary  is  small,  and  because  the  case  is  of  no 
special  significance,  it  will  not  be  discussed  in  detail.) 

'C'.e  P-  and  N-curves  corresponding  to  (6)  when  Ibl  > 1 are  given  by 
the  equations 


x(t)  - ^ - 1 

y(t)  [(b  - + (b  + ] 


(29) 


>^he^e  p b - 1 and 


^ ^ ^ B = - ^[y(0) + (b-p)(x(0)  ^1)]  (30) 


2P 


The  N-system  consists  of  parabola-like  curves  which  tend,  when  b > 1 , to 
the  point  (-1,0)  as  t — *-oo  and,  when  b « -1  , away  from  itj  this  kind 
of  singular  point  is  called  a (stable  or  unstable)  node.  The  special 
straight-line  solution  which,  with  the  parameter  eliminated,  can  be  written 
y s .(b  + ^i)(x  + 1)  plays  a special  rolei  it  will  be  called  the  ma.jor 
N-separatrix,  Rough  sketches  of  the  N-system  for  the  two  signs  of  b are 
given  beiow-  To  visualize  the  P-system,  one  should  imagine  all  the  curves 
moved  two  luilts  to  the  right  along  y = 0 . 


(The  "degenerate  node"  occurs  when  p = 0 ; in  this  case  the  family 
of  curves  in  the  smaller  sectors  bounded  by  the  linear  curves  collapses  to 
a single  straight  line,  and  all  the  remaining  curves  are  of  the  type  which 
crosses  the  x-axis  once.) 


When  b > 1 there  are  paths  from  each  point  in  the  plane;  this  will 
become  clear  when  the  minimal  paths  are  described  below.  If  b $ -1  , let 
A be  the  P-semicurve  obtained  by  starting  at  (-1,0)  and  letting  t-^-<^ 
and  let  B be  the  symmetrical  N-semicurve.  A and  B together  bound  a 
certain  open  neighborhood  S of  the  origin. 

When  b $ -1  , a path  from  p exists  and  only  if  p C S . 


That  a path  from  p can  be  found  when  p £ S will  be  seen  presently. 

However,  no  path  can  enter  S from  the  outside, 
for  no  path  can  cross  either  A or  B moving 
inwards.  Consider  B } it  is  itself  an  N-arc, 
so  no  path  can  cross  it  with  an  N-arc;  it  will 
thus  suffice  to  prove  that  all  P~curves  crossing 
B do  so  moving  outwards.  (This  will  prove  the 


for  A will  follow  by  symmetry.)  By  (6),  = 


assertion  for  B , and  the  corresponding  assertion 

— ^ ~ ~ . Therefore 

y 

at  any  point  on  B (y  > 0)  the  slope  of  the  P-curve  is  greater  than  that 
of  the  N-curve,  i.e.,  of  B itself;  since  both  curves  move  to  the  right, 
this  means  that  each  P-curve  crosses  B upwards,  as  was  to  be  proved. 

Lemma  Ii.  If  1 ^ Xp  < <»  * path  from  p cm  begin  with  a P-arc. 


Proof.  For  b $-1  , there  exist  no  paths  at  all  from  such  a point. 
When  b ^ 1 , the  statement  is  obvious  for  = 1 , for  then  there  is  no 
P-arc  from  p . For  x^  > 1 , the  P-curve  through  p goes  downwards  and 
to  the  left,  as  can  be  seen  from  (6);  if  a path  from  p began  along  this 
curve,  it  would  be  bound  to  stay  cn  it  until  it  crossed  y = 0 , by  the 
condition  on  corners  (page  21+) ; but  sucii  a cur'';^  never  coes  this  — it 
tends  monotonely  to  the  point  (1,0),  remaining  in  the  lower  half -plane. 


Lemma  5.  Given  My  P-path  (i.e,,  path  r«ginning  with  a nonvacuous 
P-arc)  from  a point  p on  the  Interval  0 < x < 1 , one  can  find  -m 


N-path  from  p vmich  is  shorter. 


Proof,  Such  a P-path  must  behave  as  follows:  the  P-arc  with  which  it 
begins  breaks  off  in  a corner  q aoove  the  axis,  for  the  P-curve  from  p 
does  not  return  to  the  axis  in  finite  time.  The  following  N-arc  crosses 
the  axis  downwards  (in  fact,  within  the  interval  0 < x < 1 ) and,  for  the 
same  reason,  breaks  off  in  a corner  r below  the  axis.  The  succeeding  P- 
arc  extends  back  up  to  the  axis,  which  it  meets  at  a point  s which,  since 
the  path  cannot  cross  itself,  must  lie  to  the  left  of  p j however,  s also 

lies  to  the  right  of  (-1,0).  The  situation  is  thus  as 
sketched.  It  will  be  shown,  as  previously,  that  when 
r (pq)  is  decreased  ( p and  s being  he]d  fixed), 
T(pqrs)  also  decreases,  so  that  in  particular  if  q 
is  brought  down  to  coincide  with  p a path  shorter  than  the  given  one  will 
be  obtained.  This  will  be  the  N-path  from  p whose  existence  was  claimed, 
(This  proof  differs  from  the  corresponding  one  in  the  spiral  case  in  that 
there  is  no  need  here  to  consider  the  possibility  that  r might  come  into 
coincidence  with  s before  q reaches  p ; for  it  is  easy  to  see  that  r 
cannot  come  into  coincidence  with  s at  all.)  If  X = r(pq),  p,  r (qr), 
and  <j  = r(rs),  it  will  thus  suffice  to  show  that 

^ (X  + + cr  ) > 0 


Using  (29)  and  (30)  repeatedly,  one  can  compute  the  coordinates  of  q , r , 
and  s in  the  usual  way  as  functions  of  X , /i.  , cr  , and  the  coordinates 
of  p ; from  these  one  obtains 


where  h - -b  + p , Holding  x^  and  x^  fixed,  differentiating  with  re- 
spect to  X , ana  rearranging  the  result,  one  obtains 


d 

dX 


(X  + /i 


^-h/x 


+ (T  ) = 2 


da 
dX 
hX 


+ 1 


l)e 


(32) 


It  is  clear  from  the  geometry  of  pqrs  that  > 0 j therefore  the  nu- 

merator of  the  right  member  of  (32)  is  positive.  That  the  denominator  is 
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positive  when  h < 0 (b  > 1)  follows  from  x^>  0 , h\  < 0 . When  h > 0 

(b  > -1)  , the  denominator  is  also  positive;  were  it  not,  it  would  follow 

from  (31)  that  x < -1  , and  this  is  impossible, 

3 

We  again  define  F as  that  part  of  the  P-curve  through  the  origin 
which  lies  below  the  x-axis,  and  F”  as  that  part  of  the  N-curve  through 
the  origin  which  lies  above.  F and  F~  are  symmetric  to  each  other  in 
the  origin. 

Lemma  6,  If  p is  above  F and  below  y “ 0 , any  path  from  p 
which  does  not  begin  with  an  N-arc  intersecting  F can  be  replaced  by  a 
shorter  path  from  p which  does. 

Proof.  This  Lemma  follows  from  Lemma  5 in  the  same  way  that  Corollary 
3.2  followed  from  Theorem.  3«  (See  pages  3h-36.) 

Theorem  d.  tne  case  g(x,y)  ~ x + 2by  , Ibt  > i the  unique 
minimal  path  from  a poi.nt  p 6,  S S being  the  set  from  ich  paths  exists 
is  determined  in  the  following  way;  Let  C-F  + 0 + F ; C simple 

curve  which  divides  S into  an  upper  and  a lower  part.  If  p is  in  the 
upper  part,  the  minimal  path  from  p ^ that  obtained  by  following  the  N- 
curve  from  p until  it  intersects  F , and  then  following  F into  the 
origin;  if  p is  in  the  lower  pa.^t,  by  following  the  F-curve  from  p until 
it  intersects  F”  and  then  following  F'  into  the  origin,  (In  terms  of 
(f>{x,y)  , should  be  +1  below  C and  on  F , -1  above  C and  on  F~.) 

C is  sketched  for  positive  and  negative  b below.  When  b > 1,  as 
already  observed,  S is  the  entire  plane. 


b > 1 b < -1 

Proof . Tne  theorem  will  be  prcvea  by  considering  various  possible 
positions  of  the  poitit  p in  tne  lower  half-pi.ane;  the  rest  will  follow  by 
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syminetry.  In  the  picture  which  follows  ; which  is  drawn  for  the  case  h > 1), 
Sjj  and  Sp  denote  the  major  N-  and  P-separatrices  respectively,  and  K 
denotes  the  N-arc  connecting  (1,0)  with  F . When  b < -1  , the  regions 
Rg  , and  much  of  the  follcwing  proof  (which  has 

been  written  to  fit  the  case  b > 1 ) is  irrelevant. 


A,  Suppose  p C , this  set  being  taice.-:  as  closed.  Let  A be 
any  path  from  p . by  i.emrn.i  6,  it  ma>/  be  assumed  that  .A  begins  with  an 
N-arc  pq  which  at  least  reaches  F <>  q must  He  below  tne  axis,  and 
the  P-arc  of  A which  begins  at  q must  cro.es:  the  axis  (at  cr  to  the  left 
of  the  origin);  in  fact,  we  inay  by  the  propcsi.tion  which  follows 

from  Lemma  6 by  3yirimet;-:-y,  thar.  this  arc  feac.hes  F , Let  it  intersect 
F~  at  the  point  r . The  j.o''‘gth.e  of  tne  N-  arcs  pq  and  rO  wil.l  oc 

denoted  by  1 and  c rese-' rti ■.■ej.y;  tte  time  length  of  the  P-  arc  pq 
will  be  denoted  by  fx  , (N,Bo  Ct  i.',  not  cla.lmed  tnat  rO  is  part  of  A; 
this  is  merely  an  au  xi  i -- ar  y a^c  intcodu-ei  for  conv'-tnience, ) .If,  using 
(29)“ (30),  one  computes  ths;  t'ic”dipaten  0:  q , :•  , and  0 (regarded  as 

the  end  of  the  N-aro  rC  ),  one  :ihta:rs 


2 - ( 2 


where  h ~ -b  + B , C " (b  B):/p  ‘ '‘j..  *■  ‘ • '^^e  resuit  of  differentiating 


both  sides  of  (33)  with  respect  to  tr  , which  may  be  regarded  as  the  inde- 
pendent variable  giving  pqrO  , p being  held  fixed,  is 


d 


(X  ) 


♦ e' 


Hx 


2 - Ce 


2) 


(3li) 


It  will  suffice  to  show  that  this  quantity  is  positive;  for  this  will  in5)ly 

that  X + /i  has  its  minimum  value  when  or  is  as  small  as  possible,  i.e,, 

when  it  vanishes;  but  this  is  exactly  the  case  for  the  path  claimed  to  be 
shortest,  and  for  it  X + /i  is  the  length  of  the  entire  path.  Since,  as 
usual,  ^ > 0 , it  will  in  fact  be  enough  to  prove 

0 < (2  - Ce^^)  < 2 , (35) 

frcm  which  the  positivity  of  the  right  member  of  (3ii)  follows  at  once.  To 
prove  (35),  suppose  first  that  h <0  (b  l)  . The  first  inequality  of 
(35),  since  hX  < 0 , waild  follow  from  C ^ 2 . But  this  is  true;  for 
writing  C out,  transposing  the  2 and  multiplying  both  sides  by  the  posi- 
tive quantity  (b  + 3)  gives  the  equivalent  inequality 

y + (b  + 8)(x  - 1)  ^ 0 , 

ir  r 

which  is  exactly  the  (true)  statement  that  p lies  to  the  left  of  or  on 
.'the  major  P-separatrix.  The  other  inequality  of  (35),  since  hp  < 0 , 
wovild  follow  from  C > 0 which,  in  the  same  way,  is  equivalent  with 

y^  + (b  + 8)(Xp  + 1)  > 0 , 

but  this  is  the  statement  that  p lies  to  the  right  of  the  major  N- 
separatrix,  and  this  is  also  true.  If  h > 0 (b  < -1)  , both  of  the  in- 
equalities in  (35)  are  implied  directly  by  (33). 

B.  Suppose  now  that  p t ^2  * where  this  set  is  taken  to  include 
the  adjoining  sections  of  y = 0 , , and  F , but  to  exclude  K . The 

proof  for  this  case  follows  simply  from  the  preceding  one.  In  A , all 
that  was  really  used  about  p was  that  it  lay  between  the  two  separatrices 


and  to  the  right  of  P . Thus  the  same  proof  applies  verbatim  to  any  point 

of  to  the  left  of  the  major  P-separatrix,  Suppose,  then,  that  p lies 

to  the  right  of  this  line,  by  Lemma  6,  it  is  necessary  to  consider  only 
those  paths  from  p which  begin  with  N-arcs  intersecting  P . Any  such 
path  must  pass  through  the  part  of  R2  already  dealt  with  (it  being  true 
that  P and  Sp  do  not  intersect,  for  they  are  both  P-curves)j  let  p’, 
therefore,  be  any  point  on  the  N-curve  from  p which  lies  in  that  part  of 
to  the  left  of  Sp  } by  what  has  been  shown,  the  shortest  path  from  p’ 
is  of  the  type  claimed;  therefore  the  shortest  path  from  p , consisting  as 
it  does  of  the  N-arc  pp*  and  the  shortest  path  from  p'  , is  again  of  the 
type  claimed. 

C,  Suppose,  finally,  that  p £ + R^  , this  set  being  taken 

open.  By  what  has  already  been  proved,  we  need  to  consider  the  following 
tj>pe  of  path  only;  it  begins  with  an  N-arc  (which,  if  p C R^  , must  cross 

P but  may  otherwise  be  vacuous),  say  pq  , From  q the  path  follows  a 

P-arc  qr  at  least  to  the  axis  y = 0 and  thence,  by  the  results  symmetri- 
cal to  A and  B , to  P~  , which  the  path  then  follows  into  the  origin. 

Thus  we  are  now  considering  paths  pqrO  as  treated  under  A (where  pqrO 

was  not  in  fact  the  true  path  considered),  except  that  p is  now  in  a dif- 

ferent place.  So  (33)  also  holds  here,  but  now  X + ^ + cr  is  the  actual 
length  of  the  path  being  considered,  by  the  usual  argument,  it  will  thus 
suffice  to  prove 


p being  held  fixed.  In  fact,  the  differentiation  of  both  sides  of  (33) 
with  respect  to  X , followed  by  a rearrangement  of  terms,  gives 


i (X  * ^ 


-h^  da  ^ 
dX 

o hX 

2 - Ce 


d<T 

Again,  it  is  evident  that  > D;  thus  the  numerator  is  positive.  That 

the  denominator  is  positive  follows  (when  h < 0 ) from  the  fact  that  p 
lies  to  the  left  of  Sp  or  (when  h >0  ) from  (33),  as  on  page  37. 

This  completes  the  proof  of  Theorem  d. 


VI.  THE  MINIMAL  THEORY:  g(x,y)  = -x  + 2by;  SUMMARY 

The  only  lineax*  case  yet  unexamined  is  that  corresponding  to  equation 
(?'")  of  page  3;  it  arises  when  g(x,y)  = -x  + 2by  , and  represents  the 
physically  improbable  situation  that  there  occurs  not  only  the  usual  velocity 
damping,  but  also  an  "output  dangling"  in  the  direction  opposite  to  that 
which  characterizes  ordinary  (simple  or  damped)  harmonic  motion. 

The  equations  with  which  we  have  to  deal  are  therefore 

dx  dy  + T 

dt  * y > dt  ‘ ^ ^ • 
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The  P-system  corresponding  to  these  equations  consists  of  hvperbola-like 
curves  for  which  the  poiht  (-1,0)  is  a saddle  point,  the  separatrices  being 


the  lines 


y = (-b  - 6)(x  + 1)  , 


6 b + 1 > b 


Note  that  irrespective  of  the  sign  of  b the  two  separatrices  have  slopes 
of  opposite  sign.  In  fact,  the  value  of  b plays  no  role  at  all  in  de- 
termining the  qualitative  behavior  of  the  curves.  This  fact,  together  with 
the  experience  derived  from  the  preceding  sections,  enables  one  to  dispatch 
this  case  rather  swiftly. 

..  sketch  of  the  P-system  is  given  below.  The  N-system  Is  similar, 

only  moved  two  units  to  the  right.  The  P- 
yA.  and  N-curves  are  given  explicitly  by  the  equa- 


tions 


x(t)  = e"^^(Ae®^  + Be’^“)  ^ 1 

y(t)  = -e  [(b  - 6)ae^^  + (b  + 5)Le"^^] 


whure  A and  are  t'iven  bv: 


+ (b  + 6)  [x(0;  - l]  ^ » - = - - d)  [x(vj)  - x]|-. 


We  first  note  that  £ path  from  the  point  p exists  if  and  only  if 
p lies  in  the  strip  bounded  by  the  parallel  lines  y = -(b  + 6)(x  - 1)  . 

To  get  the  necessity  it  will  suffice  to  show  that  no  path  can  cross 
either  of  the  bounding  lines  into  the  strip.  Consider  the  line 
y = -(b  + 6)(x  + 1)  j it  consists  of  two  P-curves,  and  therefore  cannot 
be  crossed  by  any  P'Curve  whatsoever.  Moreover,  by  the  relation  which 
subsists  between  the  slopes  of  the  P-  and  K-curves  at  a point,  it  can  be 
seen  that  N-curves  always  cross  the  line  going  out  of  the  strip.  Thus  no 

path  can  enter  the  strip  along  this  linej  the  same 
holds  for  the  other  bounding  line  by  symmetry. 

That  there  do  exist  paths  from  any  point  within 
the  strip  will  be  seen  below. 

Now  let  r be,  as  before,  that  part  of  the 
N-curve  through  the  origin  which  lies  above  the 
x-axis,  r that  part  of  the  P-curve  through  the 
origin  which  lies  below.  C --  r~  + 0 + F is 
again  a simple  curve  which  divides  the  strip  defined  above  into  an  upper 
and  a lower  part. 


Theorem  When  g(x,y)  - -x  + ?by  , b ^ 0 , the  unique  minimal  path 
from  any  point  p ^ the  strip  bounded  by  the  tv;o  lines  y = -(b  + 6)(x-l) 
gtvsn  ^ follows;  if  p lies  above  C (see  the  preceding  paragraph), 
by  f ollowing  the  N-curve  from  p F and  then  following  F into  the 

origin}  ^ p lies  below  C , symmetrically. 

Proof-  The  proof  Is  much  like  previous  onesj  so  much  so,  in  fact, 
that  it  is  not  worthwhile  to  give  it  in  complete  detail.  The  essential 
things  to  prove  are; 


(i)  Given  any  P-path  A from  a point  p on  the  interval  0 < x <1, 
one  can  find  an  N-path  from  p which  is  shorter  than  A , 


From  this  follows,  as  in  the  derivation  of  Corollary  3,2,  that  for 
any  point  p below  y = 0 and  abcve  F (and,  of  course,  within  the 
strip)  it  is  only  necessary  to  consider  paths  from  p which  begin  with  N- 
arcs  intersecting  F , 


(ii)  The  unique  minimal  path  from  a point  p on  the  interval 
0 < X < 1 is  obtained  by  following  the  N-curve  from  p to  F , and 
then  following  F into  the  origin. 

In  view  of  (i),  to  prove  this  it  will  suffice  to  show  that  any 
path  from  p which  intersects  F with  its  initial  arc,  an  Tl-arc,  is 
longer  than  the  claimed  minimal  path  ;f  it  doesn’t  coincide  with  it. 


Now  let  p be  any  point  below  the  x-axis  and  wit'nln  the  strip, 
and  let  A be  a path  from  it,  A begins  with  an  N-arc  of  length  X 
which,  if  p is  to  the  right  of  F , must  go  as  far  as  F , but  other- 

wise may  be  vacuous.  There  follows  a P-arc  qr  with  q below  the  axis 
which  must  intersect  the  axis  at  exactly  one  point.  This  point  of  inter- 
section, in  fact,  lies  on  the  interval  -1  < x ^ 0 , so  that  the  result 
arising  from  (ii)  by  symmetry  implies  that  we  may  suppose  that  r lies  on 
F . and  that  the  rest  of  the  path  A is  obtained  by  following  F~  from 
r to  the  origin.  Such  a path  is  thus  uniquely  determined  by  X (p  being 

fixed),  and  may  be  written  A (X)  , To  complete  the  proof  for  the  lower 

P 

half -plane  ^the  rest  will  follow  by  symmetry)  it  will  therefore  suffice  to 
prove: 


(iii)  The  length  of 


A (X) 
P 


is  a strictly  increasing  function  of  X , 


Proof  of  (i).  By 


the  usual  reasoning,  one  can  see  that  A must 

begin  with  an  arc  configuration  like  that  shown. 
If  X = i’(pq)  , = T'(qr)  , and  a ---■  r (rs)  , 

then  we  want  to  prove 

Computing  the  coordinates  of  the  corners,  elimi- 
nating superfluous  variables,  and  simplifying, 
one  obtains 


X 

s 


|[(x^  + l)e^^  - 2] 


where  h“6-b>0,  Jf  p and  s are  held  fixed  and  both  sides  of 
this  equation  are  differentiated  with  respect  to  X , one  gets 
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^ (\  + ft  ♦ a ) 


da  ^ T 

dX 


- (x 


-)e 


hX 


Since  >0  , the  nutnerator  is  positive.  The  denominator  Is  also  posi- 

tive, for  if  it  were  not  the  above  eouation  for  x would  make  x > 1 , 

* s 5 

and  this  is  impossible. 

Proof  of  (ii).  Let  ^6  be  any  path  from  p ■; 0 < x < 1 , y ~ 0)  , 

P P 

J^.s  observed,  we  need  consider  A onl;/  if  it  begins  with  an  M -arc  pq 
which  intersects  F , The  point  q is  follcwed  by  a P-arc  qr  which, 
by  (i)  and  sjTnmetry,  may  be  supposed  to  cross  or  end  on  F . If  the  point 
at  which  qr  intersects  F is  v , we  shall  consider  the  "virtral"  path 
pcvO  instead  of  the  true  path  A - por,.,0  ; for  if  it  can  be  shewn  that 
T (pqv)  is  a strictly  increasing  function  of  X ^ r{pq)  (or,  equivalently, 
of  cr  t(vO)),  (ii)  will  follov/  as  in  A,  pages  56-S7, 

Again  computing  the  coordinate"  of  the  corners  q y v , and  0 
(0  Doing  regarded  as  the  ord  of  the  arc  vO  ),  and  so  on,  one  gets 


(x^  - * 2 • f {2 

where  X ^ (pq)  , ft  - T(qv) 
ticn  (3d)>  page  56,  with  C - 1 
which  is  what  is  needed,  it  will 


r (T 


(33 


a’  ^ 


, and  £7  = T (vO)  , This  is  exactly  equa- 

d , 

- X : therefore  to  gat  - — {\  + m ) > 0 
p ^ d or  ^ 

suffice  to  prove; 


■o 


hX  . 
oo  - (x 

P 


.)e 


hX 


? < 2 


The  second  inequality  is  obvious,  since  x < 1 ; the  fir't  follows  from 

P 

v33 ■ ) and  h > 0 , 

Proof  cf  fiii),  A (x)  ir  exac*;ly  like  tne  "virtual"  path  pqvC 

o 

de'jcrib^d  ab^^ve,  except  for  the  faot  that  p no  longer  Jien  on  the 
x-sixir^o  The  equation  corresponding  -o  (3V)  is 


..  h?\ 
^ he 


f '■  ^ fr. 


-ncT 


(33") 


-•  'ny  + X 
F F 


ulffarentlallng  (13")  gives 


-hu  d<r 

e ^ T.-  + 1 
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+ a 


) = 


Ke 


m. 


The  niamerator  is  again  positive;  that  the  denoiriinator  is  positive  follows 
from  (33")  and  h > 0 . This  completes  tne  whole  proof. 

SUMMARY  0?  THE  MINIMAL  THEORY  FOR  g(x,y)  LINEaR 

It  was  pointed  out  on  page  2 that  the  equation  (2)  can  always  be 
^written  in  one  of  the  three  forms  (2')-(2"').  In  terr'S  of  the  problem 
treated  here,  this  means  that  when  g(x,y)  is  linear  one  may  suppose 
that  it  has  one  of  the  three  forms  (i)  g(x,y)  = by  ; (ii)  g(x,y)  = x + 2by  ; 
(iii)  g(x,y)  = -X  + 2by  , where  b is  an  arbitrary  constant.  The  problem 
has  now  been  completely  solved  for  all  three  cases,  and  therefore  for  (2); 
the  solution  for  (i)  is  Theorem  2 (page  26);  for  (ii).  Theorems  7 (page 
5l)  and  8 (page  55);  for  (iii),  Theorem  9 (page  bO).’^  All  these  results 
may  be  summarized  in  the  following  form: 

Theorem  10.  When  g(x,y)  ^ linear,  the  points  from  which  paths 
exist  for  the  corresponding  system 


dx 

dt 


y 


dt  ^ 


- g(x,y) 


(U) 


form  a certain  connected  open  set  5 containing  the  origin.  There  exists 
^ unique  simple  curve  C passing  throi^h  "Uie  origin  wnich  divides  S into 
an  upper  and  a lower  part,  and  such  that  if  (x,y)  = 1 above  C and  on 
the  part  of  C the  right  of  the  origin,  cp  (x,y)  = --i  ^ the  rest  of 
S f then  the  solution  c~..irve  of  (it)  from  arty  point  p £ S .i^  the  unique 
minimal  path  from  p . The  set  S and  the  curve  0 can  be  explicitly 
described  when  g(x-y)  ^ given. 

Briefly,  the  problem  stated  at  the  bctton  of  page  3 has,  when 
is  linear,  a unique  solaticn  which  can  be  explicitly  described.  This  is 
the  central  result  of  this  paper. 


The  physically  important  cases  are  (i)  and  (ii)  with 


VII.  g(x,y)  NONLINEAR 


When  it  becomes  necessary  to  attack  a problem  for  which  g(x,y)  is 
nonlinear,  the  most  one  should  hope  to  be  able  to  carry  over  from  the  last 
three  sections  is  a few  partial  arguments  and  a point  of  view;  for  nearly 
everything  that  has  been  done  for  the  linear  case  depended  rather  heavily 
on  having  explicit  expressions  for  the  P-  and  N-curves,  and  this  is  natu- 
rally out  of  the  question  for  most  nonlinear  equations.  However,  some  of 
the  arguments  which  have  been  used  did  not  really  rely  on  those  explicit 
expressions,  and  may  be  assembled  in  various  ways  to  yield  results  under 
more  liberal  hypotheses.  An  example  of  such  a result  is  given  in  Part  A 
below. 

In  general,  each  nonlinear  problem  is  likely  to  require  a separate 
treatment,  and  usually  a qualitative  and  partial  discussion  is  the  best 
one  can  legitimately  expect.  Such  a discussion  for  the  interesting  case 
in  which  the  F-  and  N-systeras  each  contain  a stable  limit  cycle  of  the  re- 
laxation oscillation  type  is  given  below  in  Part  E, 

A.  g(x,y)  INDEPENDENT  OF  x 

The  results  of  IV  (where  g(x,y)  = by  ) for  b ^ 0 really  depended 
only  on  the  qualitative  behavior  of  the  solutions,  and  can  therefore  be 
generalized;  one  such  generalization  is: 

Theorem  11.  If  (1)  g(x,y)  has  the  following  properties: 

(i)  g(x,y)  ^ independent  of  x ; i.e.,  g(x,y)  = f(y)  ; 

(ii)  f(-y)  = -f(y)  ; 

(iii)  f(-K)  = 1 , f(y)  <1  for  lyl  <K  , K being  some  positive 
constant; 

and  ( 2 ) R ^ the  open  strip  boimded  by  the  two  lines  y = -K  ; then 

there  exists _a  unique  minimal  path  from  any  point  p E R , determined  as 
in  Theorem  2, 

Examples.  Some  functions  satisfying  (i)-(iii)  are;  g(x,y)  = Cy^  , 

C < 0,  n = 1,  3,  5, . . .,  K = - C ; g(x,y)  = -sin(ay)  (K  = n/2a)  . 

Proof . The  assumption  (i)  implies  that  any  ?-curve  can  be  obtained 
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if 


A 


i 
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from  any  other  by  translating  it  along  the  x-axis;  (ii)  gives  the  condition 
for  symmetry,  as  discussed  at  the  end  of  III,  The  crucial  assumption  is 
(iii).  The  P-curve  through  the  origin  (which  will  be  taken  as  a typical 
P-curve)  satisfies  the  equations  ^=y,  ^(y)  • At  the  origin 

the  motion  is  upwards  and,  as  it  leaves  the  origin,  tending  to  the  right. 

By  (ii)  and  (iii),  f(K)  = -1  ; therefore  f(y)  < 1 , and  the  function 
1 ' f (y)  attains  a positive  minimum  € , on  0 $ y < K , Fran  this,  by 

the  second  differential  equation. 


y = [1  - f(y)]  dt  ^ € (t  - t ) (0  $ y ^ K) 

*t 

0 

where  t^  is  the  value  of  the  parameter  representing  the  origin.  But  this 
means  that  for  some  value  of  t (e.g.,  t = t^  + K/e  ) the  curve  will 
have  crossed  the  line  y = K . Thus  the  P-curve  through  the  origin  moves 
monotonely  upwards  and  to  the  right  from  the  origin  until,  after  some 
finite  interval,  it  intersects  y = K , 

By  applying  a similar  argument  to  any  interval  -K  + 6 $ y ^ 0 

(0  < 6 < K),  we  see  that  the  curve  must  cross  each  line  y = -K  + 6 as 

t - GO  and  must  therefore  come  arbitrarily  close  to  the  line  y = -K  , 

But  it  cannot  cross  this  line,  which  is  itself  a P-curve.  Once  the  curve 

has  crossed  the  line  y = -K  + 6 (for  decreasing  t ) it  must  remain 

within  the  strip  -K  < y < -K  +6  , for  ^ = 1 - f(y)  >0  in  this  strip, 
dx 

Moreover,  since  — = y<-K+6<0  in  this  strip,  the  curve  goes  in- 

CIX> 

finitely  far  to  the  right  as  t - oo  . 

Thus  it  stands  verified  that  the  P-curve  through  the  origin,  and 
therefore  all  P-  and  N-curves,  have  exactly  those  properties  of  P-  and 
N-curves  used  in  proving  Theorem  2 for  b < 0 . The  proof  of  that  theorem, 
accordingly,  can  be  applied  directly  here,  and  gives  the  result  stated. 

Corollary  11,1,  The  conclusion  of  Theorem  11  remains  true  if  one 
replaces  (iii)  and  (2)  by; 

(iii')  f(y)  <1  , for  -oo  < y < co  ; 

( 2'  ) R the  entire  plane  . 


1 


Proof.  Everything  ia  as  before,  except  that  now  the  P-curve  through 
the  origin  goes  strongly  to  infinity  in  the  sense  that  if  x(t)  and  y(t) 
represent  this  curve,  x(t)  — ^ co  and  y(t)-»’-oo  as  t-*-“00  , while 
x(t)  — ► CO  and  y(t)  oo  as  t-»-oo.  The  curve  thus  behaves  qualitatively 

like  a parabola  with  its  vertex  at  the  origin  and  y = 0 as  its  axis.  The 
proof  of  Theorem  2 for  b = 0 consequently  works  here. 

Corollary  11.2,  The  conclusion  of  Theorem  11  remains  true  if  one 
replaces  (iii)  jjy: 

(iii")  f(“K)  = 1 and  f(y)  < 1 for  [y|  < -K  , K being  some 
negative  constant. 

Proof.  In  the  same  way  that  Theorem  11  corresponds  to  Theorem  2 
with  b < 0 , this  corresponds  to  Theorem  2 with  b > 0 . There  is  the 
difference  that  Theorem  11  accounts  for  all  the  points  from  which  paths 
exist,  while  for  a function  satisfying  (iii")  there  may  very  well  exist 
points  outside  the  strip  lyl  c -K  from  which  paths  do  exist  but  for  which 
no  minimal  paths  are  described,  (Indeed,  this  happens  when  f(y)  - by  , 
b > 0 , as  we  have  seen.)  The  reason  for  this  incompleteness  is  that  it 
was  at  just  this  stage  in  the  proof  of  Theorem  2 that  the  explicit  formulae 
for  the  P-  and  N-curves  were  used. 


B,  LIMIT  CYCLES 

When  g(x,y)  is  nonlinear,  limit  cycles  can  ocnur  among  the  P-  and 
N-curves.  As  is  well  known?  the  close  study  of  these  limit  cycles 
especially  as  regards  their  exact  quantitative  characteristics  — - presents 
great  difficulties.  The  object  of  this  discussion  will  be  to  show  one  way 
limit  cycles  can  occur  in  a manner  relevant  to  our  problem,  and  then  to  say 
something  about  the  corresponding  choice  of  4>  (x,y). 


We  shall  consider  the  generalized  van  der  Pol  equation 


d T 


(36) 


where  f(x)  £ C^  and  Is  a real,  positive  parameter.  If  G(,x)  is 

defined  by 


then  (upon  putting  t = fir  ) the  equation  (36)  can  be  written 


dx  _ 

at  “ 


G(x) 


- y 


dy  _ 1 

dt  ■ 7T7 


X 


In  this  situation  a theorem  due  to  LaSal3.e  (LaSalle  (1)) -states  that: 


If  there  exist  four  numbers  < a^  < 0 < < ai  such  that 

— 3 h 

(a)  G(a^)  = G(a^)  G(a^)  = G(a^)  , 

(b)  G(a2)  ^ G(x)  ^ G(a^)  for  a^  ^ x ^ a^  , and 

(c)  G'(a^)  <0  and  G' (a|^)  <0  , 

then  for  fi  > /x  ^ > 0 there  exists  a uniqu e stable  limit  cycle  in  a 
certain  neighborhood  of  the  curve  H (see  the  figure) j as  p.  — r~  oo  , 
the  limit  cycle  converges  to  H , 


What  we  are  really  concerned  with  is 
not  (36),  but  rather  the  corresponding 
equations 


d^x 

dr^ 


f (x)  ^ + X = 1 1 
^ dT 


(37) 


Because  of  the  nonlinearity,  the 
existence  of  a periodic  solution  of  (36) 
does  not  guarantee  the  existence  of  a 
periodic  solution  for  either  of  the  equa- 
tions (37).  What  one  would  like  to  see  happen,  however,  is  that  there 
occur  a P-limit  cycle  lying  slightly  to  the  right  of  the  origin  and  an 
N-limit  cycle  slightly  to  the  left,  so  that  each  could  act  as  a "big 
focus"  and  the  curves  F and  F , defined  as  in  the  spiral  case  (see 
page  33),  would  be  spiral  arcs  acting  as  they  did  there."'  One  would  then 


One  could  also  seek  P-  and  N-cycles  ea-'h  just  barely  containing  the 
origin  at  its  left  or  right  extreme  respectively;  this  would  lead  to 
considerations  similar  to  those  which  follow. 
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suppose  that  the  minimal 

r" 


paths  from  points  in  some  neighborhood  of  the 

origin  would  be  obtained  by  taking  (x,y)  = +1 
above  C and  on  V , -1  below  C and  on  F” 
within  the  neighborhood.  Indeed,  the  very  pur- 
pose here  is  to  imitate  the  spiral  case,  having 
limit  cycles  in  place  of  foci. 


c = r~  + 0 + r 


If  we  put  X 


(37)  becomes 


d^u 


+ fj.  f (u  - 1)  ^ + u = 0 


(36-) 


and  this  is  subject  tn  LaSalle's  theorem.  Putting 


G+i(u)  = -J*  f (v  - 1)  dv  , 

o 

one  gets  the  functions  G^^(u)  and  G ^(u)  corresponding  to  G(x)  for 
the  P-  and  N-sys^ems  respectively.  It  is  easy  to  see  that  the  curve  of 
G ^(u)  can  be  obtained  by  moving  that  of  G^^(u)  two  units  to  the  left 
along  y = 0 and  then  raising  or  lowering  it  until  it  again  passes  through 
the  origin. 


It  follows  from  LaSalle's  theorem  that  both  the  equations  (36-)  will 
give  limit  cycles  in  the  u,y-plane  ( y being  as  above,  and  not  identified 
with  ^ as  previously)  as  described  when  is  sufficiently  large  if 

each  of  the  fxmctions  G+^^(u)  satisfies  a set  of  conditions  (a)-(c),  i.e,, 
if  their  common  curve  has  the  general  shape  of  that  in  the  following  figure. 

If  ua  (the  abscissa  of  the  point  a 
y with  respect  to  the  u-axis)  satisfies 


t 


St- 


TA"-! 

A H 


ua  > -1  , then  for  a given  € > 0 and 
/i  sufficiently  large  the  limit  cycle 
corresponding  to 
extreme  point  p 


-1  < u < u^  + € 

P a 

cycle  corresponding  to 


will  have  its  left 
on  the  interval 

Similarly,  the  limit 


H 


-1 


will  have 


its  right  extreme  point  q on  the  inter- 


val V,  - € 


< V < 1 

q 


if 


Vj  < 1 

d 
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The  function  u(t)  which  represents  the  limit  cycle  corresponding  to 
is  thus  a periodic  function  whose  minimum  value  lies  in  the  interval 
(-1,  u + ^ ) . The  corresponding  P-curve  (in  the  original  x,  ~ -jilans)  for 

3 CL  V 

(37)  is  therefore  a limit  cycle  whose  left  extremity  lies  on  the  interval 
(0,  1 + u + « )}  this  point  must  lie  on  the  x-axis  because  ^ = y f 0 off 

the  axis.  This  limit  cycle  is  consequently 
of  the  kind  illustrated,  as  was  desired.  An 
M-lir-it,  cycle  of  the  corresponding  type  on  the 
other  side  of  the  origin  is  obtained  j.n  the 
same  way. 

It  is  not  certain  that  these  limit  cycles 
are  the  only  ones  in  the  P-  and  N-systems^  or 
even  that  they  are  the  only  ones  passing 
through  a prescribed  neighborhood  of  the  origin;  but  when  they  are,  the  P- 
and  N-curves  behave  near  the  origin  in  the  manner  described  on  the  preceding 
page. 

In  order  for  all  this  to  happen,  the  lunction  G^^(u)  must  have  four 
relative  extrema,  and  therefore  f(x)  must  have  at  least  four  zeros  and  be, 
if  a polynomial,  of  degree  four  or  higher.  (A  simple  transcendental  functicn 
which  has  the  necessary  properties  is 

f(x)  = n cos  nx  + cx  , 

where  c is  a constant  which  must  be  properly  chosen;  it  gives 

G^^(u)  = sin  nu  - cu  , 

a function  whose  graph  is  of  the  type  shown.) 

It  was  once  thought  that  the  introauction  of  such  limit  cycles  might 
make  for  better  response  of  the  system  in  the  vicinity  of  the  origin  than 
linear  systems  could  provide;  that  the  presence  of  the  limit  cycles  might 

accelerate  the  m.otion  along  the  P-  and  N-curves  nearby.  But  this  violates 

our  basic  principle  (wnich,  to  be  sure,  has  not  been  rigorously  demonstrated) 
that  the  best  control  should  be  that  which  uses  the  full  magnitude  of  the 
available  control  force  at  all  times;  for  if  a certain  amount  nf  force  is 
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available  to  be  put  into  the  first  order  term  in  (3?),  it  would  then  be 

better  to  take  it  at  its  full  strength  and  combine  it  with  the  control 

force  already  present  (in  the  form  -1)  on  the  right-hand  side,  than  to 

• dx 

use  it  partially  and  continuously  in  the  form  of  the  function  f (x)  . 

However,  if  an  equation  like  (37)  is  dictated  by  circumstances^  and 
the  function  f(x)  cannot  be  altered  at  will,  a discussion  of  the  above 
sort  must  be  undertaken.  In  the  case  which  has  been  considered  the  limit 
cycles  occur  in  what,  from  the  standpoint  of  tractability,  is  probabiy 
the  simplest  way;  but  even  so  it  is  complex  enough.  But  even  when  such  a 
discussion  can  be  carried  out  in  sufficient  detail,  it  is  still  only 
prefatory  to  the  treatment  of  the  problem  actually  at  hand,  that  of  finding 
the  minimal  paths.  Except  for  conjectures  based  on  analogies  with  the 
linear  case,  nothing  whatsoever  has  been  discovered  along  these  lines. 
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APPENDIX  I.  VELOCITY  CONTROL^'' 


In  section  II  a description  was  given  of  the  conclusions  reached  in 
previous  investigations  of  the  behavior  of  the  solutions  of  the  equation 


+ X 


(0  < b < 1) 


(2') 


when  <p  (x,y)  is  of  the  special  type  <p  (x,y)  = sgn(Kx  *■  My)  , K and  M 
being  constants.  The  same  group  of  investigators  has  also  studied  the 
similar  problem  in  which  the  control  force  has  a derivative  like  this;  i.e., 
the  problem  associated  with  the  equation 

2 

i-|+2bg  + x=  ^ (0<b<l)  (3b) 

dt 


where  is  a continuous  function  of  t and,  at  points  where 

Kx  + My  + N\//  / 0,  satisfies 

= sgn(Kx  + My  + Ni|/)  , ^ ^ ^ 

K , M , and  N being  constants. 

In  the  former  problem,  0 along  any  solution  was  a step  function  al- 
ternating between  the  values  -1;  here  ip  along  any  solution  is  a continuous 
’'sawtooth"  function  whose  graph  consists  of  linear  pieces  of  slope  -1,  Such 
a system  (called  velocity  control)  has  the  practical  advantage  that,  without 
being  essentially  more  difficult  to  design,  It  gives  a smoother  operation. 
This,  in  fact,  is  its  principal  virtue;  it  does  not  claim  to  give,  for  a 
bounded  control  force,  a rapidity  of  response  comparable  to  that  of  the  dis- 
continuous, "position  control"  arrangement  which  has  been  discussed.  Still, 
one  might  consider  the  problem  of  replacing  the  function  sgn  ^ Kx 
by  a more  general  function  of  (x,  which  takes  on  only  the  values 

-1  in  such  a way  as  to  obtain  minimal  paths  in  the  ^x,  -space  and 


The  appendices  deal  with  questions  which  lie  outside  the  domain  proper  to 
the  main  text,  but  which  nevertheless  are  of  a kindred  nature  and  should 
be  discussed  for  the  sake  of  completeness. 


thus  the  most  rapid  response  within  the  limits  of  the  system.  This  problem 
has  not  been  touched.  What  has  been  done  (the  most  recent  account  is  in 
Flligge-Lotz  (2),  Chapter  5)  is  to  make  a graphical  study  of  the  solutions 
of  (3b) -(39)  after  the  manner  of  that  described  in  section  II  above.  The 
several  cases  that  occur  corresponding  to  the  different  possible  combina- 
tions of  signs  on  the  constants  K , M , and  N are  given  separate  treat- 
ments. 

The  technique  used  to  implement  the  investigation  is  first  to  intro- 
duce the  auxiliary  variable 

y(t)  = x(t)  - \^/(t) 

and  to  consider  solutions  in  the  ^y,  ^ -space,  where  they  consist 

of  arcs  whose  projecti  nns  in  the  -plane  are  arcs  of  logarithmic 

spirals.  Then  the  solutions  are  considered  in  terms  of  their  projections 
in  this  and  a certain  other  plane,  whereby  all  the  desired  information  can 
be  obtained.  One  again  encounters  end  points,  start  points,  periodic  solu- 
tions, etc.  It  may  also  happen  that  a solution  has  a last  comer  beyond 
which  it  moves  off  to  infinity;  the  purpose  of  introducing  the  term  in  {j/ 
in  the  argument  of  the  signum  function  is  to  avoid  this  phenomenon.  Fur- 
ther details  will  not  be  given  here. 


APPENDn  II.  MULTIPLE  MDDE  CONTROL 


Those  interested  in  the  practical  applications  of  the  theory  for  the 
case  in  which  g(x,y)  is  linear  will  have  noted  that  the  subject  of  time 

lag  was  ignored  in  III-VI.  The  present  appendix  will  have  something  to  say 

about  this  and  related  matters. 

It  is  easy  for  one  to  convince  himself  that  the  systems  of  minimal 
paths  discovered  in  those  sections  (and  summarized  in  Theorem  10.  page  63) 
are  "structurally  stable  with  respect  to  time  lags"  in  the  sense  that  the 
presence  of  a sufficiently  small  time  lag  does  not  affect  the  essential 
over-all  behavior  of  the  solution  curves.  This  follows  from  the  fact  that 
the  time  length  of  the  minimal  path  from  a point  depends  continuously  on 
the  position  of  the  point,  from  the  character  of  C with  relation  to  the 
P-  and  N-Curves,  and  so  on.  In  fact,  the  only  place  where  a small  time  lag 
can  seriously  mar  the  qualitative  situation  is  at  the  origin.  There  a time 

lag  has  the  effect  of  causing  a solution  to  overshoot  the  origin  slightly 

Instead  of  ending  there,  have  a corner,  overshoot  in  tne  reverse  direction, 
have  another  corner,  and  repeat  the  act  indefinitely,  (Cf . page  l6.) 

Perhaps  the  first  means  of  avoiding  this  which  suggests  itself  is  that 
of  slightly  altering  the  curve  C so  that  the  time  lag  is  anticipated  and 
pre-correctedj  but  this  is  not  really  feasible,  for  the  time  lag  is  a com- 
plex and  variable  thing  which  cannot  be  predicted  precisely  or  be  expected 
to  be  always  the  same.  Indeed,  the  assumption  that  a time  lag  is  simply  a 
clean,  sharp  delay  in  the  occurrence  of  a corner  is  itself  a considerable 
distortion  of  the  real  state  of  affairs. 

Far  better  is  the  plan  of  disconnecting  the  control  entirely  as  soon 
as  the  solution  has  entered  a satisfactorily  small  neighborhood  of  the 
origin,  and  replacing  it  there  either  by  a control  of  a different  sort,  or 
by  no  control  at  all,  letting  the  system  run  free. 

If,  for  instance,  the  system  in  question  is  given  by  the  equation 

^ + = , (0<b<l) 

dt'^ 

when  the  control  force  -1  is  removed  the  solution  curve  spirals  into  the 


origin.  (The  very  function  of  the  control  fores  *-l  is  to  accelerate  this 
process.)  If,  therefore,  the  mechanical  system  is  so  designed  that  the 
force  represented  by  -1  is  switched  in  accordance  with  Theorem  10  (page  63 )» 
except  that  in  some  small  neighborhood  of  the  origin  in  the  x,y-plane 

it  is  removed  entirely^  then  a typical  solution  curve  would  be- 
have in  the  following  way;  it  would  move  rapidly  in  from  its  initial  point 
and  then,  having  passed  through  a finite  number  cf  corners  (in  any  but  the 
spiral  case,  at  most  one)  it  would  enter  the  neighborhood  mentioned^  once 
this  had  happened  the  solution  would  spiral  gently  down  to  the  origin.  In 
the  presence  of  a time  lag  the  solution  curve  would  not  behave  differently. 
This  arrangement  has  the  valuable  properties  that; 

1)  it  reduces  any  error,  together  with  its  first  derivative,  almost 
to  zero  very  rapidly,  and  thus  accomplishes  the  purpose  of  the  system 
within  arbitrarily  small  tolerance  limits; 

2)  it  prevents  high-frequency  cs cilia tiens  back  and  forth  around  the 
origin  such  as  would  occur  in  the  presence  of  a time  lag  otheiwise; 

3)  it  prevents  the  system  from  being  hypersensitive  to  trifling  dis- 
turbances from  the  zero -error  state. 

Such  an  arrangement  gives  what  is  called  dual  mode  control;  this  is 
what  arises  in  the  conventional  theory  of  servomechanis.ms  when  the  feedback 
circuit  has  a threshold  of  sensitivity  below  which  it  gives  no  output  at 
all,  (See  for  example  the  Appendix  in  MacColl  (1).)  In  the  context  of  our 
minimal  theory,  its  principal  interest  lies  in  assuring  the  mathematician 
that  conclusions  reached  disregarding  time  lags  need  not  be  utterly  useless 
when  confronted  with  practical  demands,  but  the  problem  of  choosing  the 
"right”  dual  moae  arrangement  in  a particular  situation  is  not  a mathemati- 
cal one. 

In  general,  a multiple  mode  control  might  be  defined  as  a control 
system  in  which  different  kinds  of  control  are  to  be  applied  in  different 
parts  cf  the  phass  plane.  Such  arrangements  are  frequently  desirable  when 
one  or  more  of  t.he  assumptions  underlying  the  problem  stated  in  I fail.  An 
instance  of  this  is  discussed  at  some  length  in  the  next  appendix. 


One  should  perhaps  include  with  multiple  mode  controls  those  subject 
to  special  restraints  of  various  kinds.  I'or  instance,  it  may  happen  that 
physical  limitations  prevent  the  magnitude  of  the  derivative  of  the  con- 
trolled variable  (in  terms  of  the  example  of  page  1,  the  angulax  speed  of 
the  motor)  from  exceeding  a certain  value,  K , so  that  any  solution  must 
be  contained  in  the  horizontal  strip  lyl  $ K in  the  phase  plane.  In  this 
case  there  is  a multiple  mode  control  in  the  sense  that  the  control  is  not 
meaningfully  described  at  all  outside  the  strip;  if  a path  reaches  one  of 
the  boundaries  of  the  strip  (necessarily  from  the  xntsrior)_  it  must  follow 
this  boundary  for  some  distance  in  the  props,  direction  and  then  return  to 
the  interior  and,  eventually,  the  origin.  It  is  easy  to  see  that  for  the 
case  in  waich  g(x,y)  = by  the  curve  C should  be  the  same  (within  the 
strip  tyi  K ) as  if  y were  unconstrained;  what  happens  in  more  com- 
plicated cases,  for  example  the  spiral  case,  is  not  clear. 


APPENDIX  III.  COULOMB  DaMPING"^ 

The  whole  problem  with  which  we  have  dealt  derived  much  of  its  im- 
portance from  the  assumption  that  the  best  behavior  (in  terms  of  response 
time)  should  be  obtained  from  a servomechanism  when  the  full  strength  of 
the  power  source  is  usedj  but  tiiis  "full  strength"  may  vary  for  different 
states  of  the  system,  instead  of  remaining  constant  as  we  have  supposed. 
In  this  appendix  such  a case  will  be  discussed. 

In  terms  of  the  example  of  page  1,  it  may  be  possible  to  apply  to 
the  motor  a second  control  force  in  the  form  of  "Coulomb  damping,"  which 
is  characterized  by  being  constant  in  magnitude  but  opposite  in  sign  to 
the  derivative  of  the  output;  the  equation  for  the  system  could  therefore 
be  written 


d^x 

dt^ 


dx 

„ „ dx\ 

„ . / dx^ 

/ dx\ 

dt 

^ ^ ^ ^’dt ) 

* Idt) 

9 

(itO) 


where  everything  is  as  in  equation  (1),  page  1,  except  for  C , which  is 
some  positive  constant,  and  h (^i^)  » which  takes  on  only  the  va^_ues  +1 

and  0 and  represents  the  instruction  from  the  control  as  to  whether  or  not 
the  Coulomb  danping  is  to  be  applied  at  any  particular  time.  The  ’’ight  hand 
member  of  the  above  equation  can  be  written  simply  F > where  this 

function  assumes  only  the  values 

K , -K  , K - C , -K  - C (for  o) 

K , -K  , K + C , -K  + C ( for  p < o) 

K , -K  (for  ^ - o)  . ^ 


Now  the  natural  question  to  ask  is:  How  should  F be  chosen  so  that  all 

solutions  of  (UO)  will  go  to  the  origin  in  the  x.y,  (=  -plane  in 
minimum  time?  If  the  assumption  mentioned  above  is  valid,  it  should  again 
be  true  that  best  results  can  be  obtained  by  using  only  the  extreme  valuec 
of  F available  at  any  point.  This  is  equivalent  to  saying  that  F 


ihis  problem  was  recently  suggested  to  the  writer  by  Professor  u.L.  ftauch 
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assumes  only  the  values 

K , “(K  + C)  (for  ^ >o)  Sind 

-K , K + C (for  ^ < 0 ) . 

This  problem  is  mathematically  equivalent  with  that  which  arises 
when  it  is  assumed  that  the  Coulomb  damping  is  applied  uninterruptedly, 
i.e,,  that  h = +1  , and  that  C < K j in  this  case  P'  takes  on  only  the 
values^ 

K - C , -K  * C (for  ^ > o)  and 

K ♦ C , -K  ♦ C (for  < 0 ) . 

Both  problems  are  special  cases  of  this  one,  which  is  a generaliza- 
tion of  the  problem  on  page  3; 

For  the  equation 

(Ul) 

with  g given  and  a function  which  is  allowed  to  take  on  only  the 

values  a , -6  (for  ^ ^ ^ » -a  , p ( for  ^ ® “ and  3 

being  positive  constants ) , how  should  4>  ^ chosen  so  that  every  solu- 

cix 

tion  of  (iil)  reaches  the  state  x = 0 , ^ ~ ® iB  least  possible 

time  ? 

The  problem  will  be  discussed  here  for  the  simplest  case,  g = 0 . 
This  gives  the  system,  equivalent  with  (hi), 

if  ■ y > it  “ • ('*2) 

^ dx. 

What  happens  for  ^ ~ 0 turns  out  not  to  matter;  the  solutions  ir.a’'  be 
defined  so  as  to  extend  across  this  axis  as  if  F had  the  same  values 
there  as  in  a contiguous  part  of  the  plane. 


Instead  of  talking  about  P-  and  N-curves,  etc.,  we  now  speak  of  a , 0 , 

-a  , *-3 -curves,  etc.  The  considerations  of  sections  I and  III  carry  over 
in  full,  although  the  definition  of  solution  (cf.  pages  6-7)  becomes  even 
more  awkward;  in  particular.  Theorem  1,  which  states  in  effect  that  one 
needs  to  consider  only  canonical  paths,  still  holds,  where  a canonical 
path  is  now  defined  as  one  which  has  no  (-0) (a) -corners  above  y = 0 and 
no  (0)  (-a) -comers  below.  All  this  holds  for  any  g € C*"  , 

The  a-cuirves  for  (i;2),  in  pararaeter-free  form,  are 

2 

y = 2a(x  + k)  (-oo  < k < c» ) 

and  similarly  for  the  other  three  values  of  <p  , These  are  of  course 
parabolas  with  y = 0 as  their  axis.  If  F is  that  part  of  the  0-curve 
through  the  origin  which  lies  below  the  x-axis,  and  F is  that  part  of 
the  (-0) -curve  through  the  origin  which  lies  above,  then  by  trivial  exten- 
sions of  the  methods  of  IV  (pages  26-26  ) 
one  gets  that  the  unique  minimal  paths  are 
obtained  by  taking  4^  as  shown,  with  4 ^ P 
on  F , = 'S  on  F , a or  0 on 

^ the  negative  half  of  the  x-axis,  4 ~ ~o. 
or  -0  on  the  positive  half.  In  fact,  the 
analogous  results  for  g(x,y)  - by  , b f 0. 
can  also  be  proved  in  the  same  way.  The 
problem  for  g less  simple  has  not  been 
studied. 
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